To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??裕??幽???ル?倚??兢???飮?? 1001100011011010001111110011111110010111010101000011111100111111100101110100100000111111001111110011111110000011100010110011111110011000110111110011111100111111100110010101110100111111001111110011111110011111010110100011111100111111 98da3f3f97543f3f97483f3f3f838b3f98df3f3f995d3f3f3f9f5a3f3f
EUC-JP 俑??裕??幽???ル?倚??兢???飮?? 1101000011011100001111110011111111001101101101010011111100111111110011011010100100111111001111110011111110100101111010110011111111010000111000010011111100111111110100011011111000111111001111110011111111011101101110110011111100111111 d0dc3f3fcdb53f3fcda93f3f3fa5eb3fd0e13f3fd1be3f3f3fddbb3f3f
UTF-8 俑앹늿裕녻굜幽껊겱曆ル뿭倚볟슫兢履덃껸飮귥㉫ 111001001011111110010001111011001001010110111001111010111000101010111111111010001010001110010101111010111000010110111011111010101011010110011100111001011011100110111101111010101011101110001010111010101011001010110001111011111010011010001011111000111000001110101011111010111011111110101101111001011000000010011010111010111011001110011111111011001000101010101011111001011000010110100010111011111010011110011111111010111000110110000011111010101011101110111000111010011010001110101110111010101011011110100101111000111000100110101011 e4bf91ec95b9eb8abfe8a395eb85bbeab59ce5b9bdeabb8aeab2b1efa68be383abebbfade5809aebb39fec8aabe585a2efa79feb8d83eabbb8e9a3aeeab7a5e389ab
UHC 俑앹늿裕녻굜幽껊겱曆ル뿭倚볟슫兢履덃껸飮귥㉫ 1110100110110101100111011110110010001000100010001110101110101110100001101110100010000010100001001110101011101011100000111110101110000001101111011110011010110111101010111110101110010111101011011110101111101111100100111110010110011010101101001101000011100111111011001010101010001000111001101011001010111001111010111110011010000010111011001010100010111100 e9b59dec8888ebae86e88284eaeb83eb81bde6b7abeb97adebef93e59ab4d0e7ecaa88e6b2b9ebe682eca8bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)