To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??邯⊂??あ?臣ケ???????栓??げ 00111111001111111110011110110110100000011011110000111111001111111000001010100000001111111001000001100010100000110101000000111111001111110011111100111111001111110011111100111111100100001111000000111111001111111000001010110000 3f3fe7b681bc3f3f82a03f906283503f3f3f3f3f3f3f90f03f3f82b0
EUC-JP ??邯⊂??あ?臣ケ???????栓??げ 00111111001111111110111010111000101000101011111000111111001111111010010010100010001111111011111111000011101001011011000100111111001111110011111100111111001111110011111100111111110000001111001000111111001111111010010010110010 3f3feeb8a2be3f3fa4a23fbfc3a5b13f3f3f3f3f3f3fc0f23f3fa4b2
UTF-8 룴창邯⊂룴횕あ룵臣ケ룵쨵▣룶뇟熉룫栓룵卽げ 111010111010001110110100111011001011000010111101111010011000001010101111111000101000101010000010111010111010001110110100111011011001101010010101111000111000000110000010111010111010001110110101111010001000011110100011111000111000001010110001111010111010001110110101111011001010100010110101111000101001011010100011111010111010001110110110111010111000011110011111111001111000011010001001111010111010001110101011111001101010000010010011111010111010001110110101111001011000110110111101111000111000000110010010 eba3b4ecb0bde982afe28a82eba3b4ed9a95e38182eba3b5e887a3e382b1eba3b5eca8b5e296a3eba3b6eb879fe78689eba3abe6a093eba3b5e58dbde38192
UHC 룴창邯⊂룴횕あ룵臣ケ룵쨵▣룶뇟熉룫栓룵卽げ 100011111010100111000011101000101100101011111011101000011111100010001111101010011100001110001111101010101010001010001111101010101110001111101101101010111011000110001111101010101010010010001111101000101100001110001111101010111011010010100001111010011111101110001111101000101110111011111011100011111010101011110001111011011010101010110010 8fa9c3a2cafba1f88fa9c38faaa28faae3edabb18faaa48fa2c38fabb4a1e9fb8fa2eefb8faaf1edaab2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)