To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰?????罐徇?????霓??泣hぜ 1000100110000001001111110011111100111111001111110011111111100011101000111001110001101101001111110011111100111111001111110011111111101000101111010011111100111111100010111000001110000010100010001000001010111010 89813f3f3f3f3fe3a39c6d3f3f3f3f3fe8bd3f3f8b83828882ba
EUC-JP 堰?????罐徇??洹??霓??泣hぜ 10110001111000010011111100111111001111110011111100111111111001101010010111010111110011100011111100111111100011111100011110111010001111110011111111110000101111110011111100111111101101011110001110100011111010001010010010111100 b1e13f3f3f3f3fe6a5d7ce3f3f8fc7ba3f3ff0bf3f3fb5e3a3e8a4bc
UTF-8 堰묐쓷流쒏씭罐徇쒒뀎洹섎뎁霓띰퐢泣hぜ 111001011010000010110000111010111010110010010000111011001001001110110111111011111010011110001010111011001001001010001111111011001001010010101101111001111011110110010000111001011011111010000111111011001001001010010010111010111000000010001110111001101011010010111001111011001000010010001110111010111000111010000001111010011001110010010011111010111001110110110000111011011001000010100010111001101011001110100011111011111011110110001000111000111000000110011100 e5a0b0ebac90ec93b7efa78aec928fec94ade7bd90e5be87ec9292eb808ee6b4b9ec848eeb8e81e99c93eb9db0ed90a2e6b3a3efbd88e3819c
UHC 堰묐쓷流쒏씭罐徇쒒뀎洹섎뎁霓띰퐢泣hぜ 1110010111101000100100011110101110011101100101001110101011111100100111001110011010011101101111101100111010111000111000101101111110011100111010011000010110001001111010101011011110011000111010111011010110101010111001111110011110110110111011111011110110001011111010111110100010100011111010001010101010111100 e5e891eb9d94eafc9ce69dbeceb8e2df9ce98589eab798ebb5aae7e7b6efbd8bebe8a3e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)