To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 瘟??擁??言??B 11100001100010010011111100111111100101110110100100111111001111111000110010111110001111110011111101000010 e1893f3f97693f3f8cbe3f3f42
EUC-JP 瘟??擁??言??B 11100001111010010011111100111111110011011100101000111111001111111011100011000000001111110011111101000010 e1e93f3fcdca3f3fb8c03f3f42
UTF-8 瘟룡뜆擁녘갬言됭떨B 11100111100110001001111111101011101000111010000111101011100111001000011011100110100100111000000111101011100001011001100011101010101100001010110011101000101010001000000011101011100100001010110111101011100101101010100001000010 e7989feba3a1eb9c86e69381eb8598eab0ace8a880eb90adeb96a842
UHC 瘟룡뜆擁녘갬言됭떨B 11101000101100001011011111100110100011011000100111101000101101101011001111101000101100001011011111100101111010111000100111101000101101101011001101000010 e8b0b7e68d89e8b6b3e8b0b7e5eb89e8b6b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)