To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 驛??衍??驛??z驛??衍??驛??zB 111010011000001100111111001111111001111110100101001111110011111111101001100000110011111100111111011110101110100110000011001111110011111110011111101001010011111100111111111010011000001100111111001111110111101001000010 e9833f3f9fa53f3fe9833f3f7ae9833f3f9fa53f3fe9833f3f7a42
EUC-JP 驛??衍??驛??z驛??衍??驛??zB 111100011110001100111111001111111101111010100111001111110011111111110001111000110011111100111111011110101111000111100011001111110011111111011110101001110011111100111111111100011110001100111111001111110111101001000010 f1e33f3fdea73f3ff1e33f3f7af1e33f3fdea73f3ff1e33f3f7a42
UTF-8 驛득굢衍꾤뿏驛듕툋z驛득굢衍꾤뿏驛듕툋zB 111010011010100110011011111010111001001110011101111010101011010110100010111010001010000110001101111010101011111010100100111010111011111110001111111010011010100110011011111010111001001110010101111011011000100010001011011110101110100110101001100110111110101110010011100111011110101010110101101000101110100010100001100011011110101010111110101001001110101110111111100011111110100110101001100110111110101110010011100101011110110110001000100010110111101001000010 e9a99beb939deab5a2e8a18deabea4ebbf8fe9a99beb9395ed888b7ae9a99beb939deab5a2e8a18deabea4ebbf8fe9a99beb9395ed888b7a42
UHC 驛득굢衍꾤뿏驛듕툋z驛득굢衍꾤뿏驛듕툋zB 111001101011111010110101111001101000001010001001111001101110001010000100111001111001011110010100111001101011111010110101111001001011100010000011011110101110011010111110101101011110011010000010100010011110011011100010100001001110011110010111100101001110011010111110101101011110010010111000100000110111101001000010 e6beb5e68289e6e284e79794e6beb5e4b8837ae6beb5e68289e6e284e79794e6beb5e4b8837a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)