To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 臟?珥?衆?第 1110010001100110001111111110000011100000001111111000111101001111001111111001000111100110 e4663fe0e03f8f4f3f91e6
EUC-JP 臟?珥?衆?第 1110011111000111001111111110000011100010001111111011110110110000001111111100001011101000 e7c73fe0e23fbdb03fc2e8
UTF-8 臟렞珥렮衆렲第 111010001000011110011111111010111010000010011110111001111000111110100101111010111010000010101110111010001010000110000110111010111010000010110010111001111010110010101100 e8879feba09ee78fa5eba0aee8a186eba0b2e7acac
UHC 臟렞珥렮衆렲第 1110110111110100100011101010111111101100101101001000111010111011111100011110101110001110101111111111000010101111 edf48eafecb48ebbf1eb8ebff0af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)