To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蹄???終???蹄???種?應? 100100101111101100111111001111110011111110001111010010010011111100111111001111111001001011111011001111110011111100111111100011101110110100111111100111001110010000111111 92fb3f3f3f8f493f3f3f92fb3f3f3f8eed3f9ce43f
EUC-JP 蹄???終???蹄???種?應? 110001001111110100111111001111110011111110111101101010100011111100111111001111111100010011111101001111110011111100111111101111001110111100111111110110001110011000111111 c4fd3f3f3fbdaa3f3f3fc4fd3f3f3fbcef3fd8e63f
UTF-8 蹄뀜렰렲終뀜렰렕蹄뀜렰렲種렟應렜 111010001011100110000100111010111000000010011100111010111010000010110000111010111010000010110010111001111011010110000010111010111000000010011100111010111010000010110000111010111010000010010101111010001011100110000100111010111000000010011100111010111010000010110000111010111010000010110010111001111010100010101110111010111010000010011111111001101000011110001001111010111010000010011100 e8b984eb809ceba0b0eba0b2e7b582eb809ceba0b0eba095e8b984eb809ceba0b0eba0b2e7a8aeeba09fe68789eba09c
UHC 蹄뀜렰렲終뀜렰렕蹄뀜렰렲種렟應렜 1111000010110100101100101111000110001110101111011000111010111111111100001111101110110010111100011000111010111101100011101010101011110000101101001011001011110001100011101011110110001110101111111111000011111010100011101011000011101011111010111000111010101110 f0b4b2f18ebd8ebff0fbb2f18ebd8eaaf0b4b2f18ebd8ebff0fa8eb0ebeb8eae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)