To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 臟??章?臟??章? 1110010001100110001111110011111110001111110011010011111111100100011001100011111100111111100011111100110100111111 e4663f3f8fcd3fe4663f3f8fcd3f
EUC-JP 臟??章?臟??章? 1110011111000111001111110011111110111110110011110011111111100111110001110011111100111111101111101100111100111111 e7c73f3fbecf3fe7c73f3fbecf3f
UTF-8 臟뚲궙章덥臟뚲궙章덥 111010001000011110011111111010111001101010110010111010101011011010011001111001111010101110100000111010111000110110100101111010001000011110011111111010111001101010110010111010101011011010011001111001111010101110100000111010111000110110100101 e8879feb9ab2eab699e7aba0eb8da5e8879feb9ab2eab699e7aba0eb8da5
UHC 臟뚲궙章덥臟뚲궙章덥 1110110111110100100011001110111010000010101011101110110111110001101101001111111011101101111101001000110011101110100000101010111011101101111100011011010011111110 edf48cee82aeedf1b4feedf48cee82aeedf1b4fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)