To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 胡頭紋曜橋?? 100011001101001110010011101010101001011011100100100101110110101010001011101101000011111100111111 8cd393aa96e4976a8bb43f3f
EUC-JP 胡頭紋曜橋?? 101110001101010111000110101011001100110011100110110011011100101110110110101101100011111100111111 b8d5c6accce6cdcbb6b63f3f
UTF-8 胡頭紋曜橋렣렦 111010001000001110100001111010011010000010101101111001111011010010001011111001101001101110011100111001101010100110001011111010111010000010100011111010111010000010100110 e883a1e9a0ade7b48be69b9ce6a98beba0a3eba0a6
UHC 胡頭紋曜橋렣렦 1111101111010111110101001110100111011010101000111110100011111000110011101110100110001110101101001000111010110101 fbd7d4e9daa3e8f8cee98eb48eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)