To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰???與齬???? 10001001100000010011111100111111001111111110010001101111111010101001011100111111001111110011111100111111 89813f3f3fe46fea973f3f3f3f
EUC-JP 堰???與齬???? 10110001111000010011111100111111001111111110011111010000111100111111011100111111001111110011111100111111 b1e13f3f3fe7d0f3f73f3f3f3f
UTF-8 堰묐쓷行與齬잓黎쀬벀 111001011010000010110000111010111010110010010000111011001001001110110111111011111010100010001000111010001000100010000111111010011011110110101100111011001001111010010011111011111010011010001001111011001000000010101100111010111011001010000000 e5a0b0ebac90ec93b7efa888e88887e9bdacec9e93efa689ec80acebb280
UHC 堰묐쓷行與齬잓黎쀬벀 1110010111101000100100011110101110011101100101001111101010100001111001101010100011100101111000011001111111101001111001101011000110010111111011001001001110100110 e5e891eb9d94faa1e6a8e5e19fe9e6b197ec93a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)