To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??㎝姨??夷??恁レ???????夷?? 001111110011111110000111011100001001101101001000001111110011111110001000110011100011111100111111100111001000110010000011100011000011111100111111001111110011111100111111001111110011111110001000110011100011111100111111 3f3f87709b483f3f88ce3f3f9c8c838c3f3f3f3f3f3f3f88ce3f3f
EUC-JP ???姨??夷??恁レ???????夷?? 0011111100111111001111111101010110101001001111110011111110110000110100000011111100111111110101111110110010100101111011000011111100111111001111110011111100111111001111110011111110110000110100000011111100111111 3f3f3fd5a93f3fb0d03f3fd7eca5ec3f3f3f3f3f3f3fb0d03f3f
UTF-8 梨뚯㎝姨랁슀夷⑹콡恁レ쮷梨숈쮼吏뺤콐夷붿쮷 111011111010011110100010111010111001101010101111111000111000111010011101111001011010011110101000111010111001111010000001111011001000101010000000111001011010010010110111111000101001000110111001111011001011110110100001111001101000000110000001111000111000001110101100111011001010111010110111111011111010011110100010111011001000100010001000111011001010111010111100111011111010011110011110111010111011101010100100111011001011110110010000111001011010010010110111111010111011011010111111111011001010111010110111 efa7a2eb9aafe38e9de5a7a8eb9e81ec8a80e5a4b7e291b9ecbda1e68181e383acecaeb7efa7a2ec8888ecaebcefa79eebbaa4ecbd90e5a4b7ebb6bfecaeb7
UHC 梨뚯㎝姨랁슀夷⑹콡恁レ쮷梨숈쮼吏뺤콐夷붿쮷 111011001011000110001100111011001010011110101111111011001010100110001101111011011001101010010011111011001010100010101001111011001011000110011001111011001111011010101011111011001010100010010100111011001011000110011001111011001010100010011000111011001010011110010101111011001011000110001100111011001010100010010100111011001010100010010100 ecb18ceca7afeca98ded9a93eca8a9ecb199ecf6abeca894ecb199eca898eca795ecb18ceca894eca894

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)