To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????i????iB 0011111100111111001111110011111101101001001111110011111100111111001111110110100101000010 3f3f3f3f693f3f3f3f6942
SJIS-WIN 章?臟魄i章?臟魄iB 1000111111001101001111111110010001100110111010011010111001101001100011111100110100111111111001000110011011101001101011100110100101000010 8fcd3fe466e9ae698fcd3fe466e9ae6942
EUC-JP 章?臟魄i章?臟魄iB 1011111011001111001111111110011111000111111100101011000001101001101111101100111100111111111001111100011111110010101100000110100101000010 becf3fe7c7f2b069becf3fe7c7f2b06942
UTF-8 章렍臟魄i章렍臟魄iB 111001111010101110100000111010111010000010001101111010001000011110011111111010011010110110000100011010011110011110101011101000001110101110100000100011011110100010000111100111111110100110101101100001000110100101000010 e7aba0eba08de8879fe9ad8469e7aba0eba08de8879fe9ad846942
UHC 章렍臟魄i章렍臟魄iB 11101101111100011000111010100011111011011111010011011011110111100110100111101101111100011000111010100011111011011111010011011011110111100110100101000010 edf18ea3edf4dbde69edf18ea3edf4dbde6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)