To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鄭?魚?d縡???蘖 100100110100000100111111100010111001101100111111100000101000010011100011011100010011111100111111001111111001111101010000 93413f8b9b3f8284e3713f3f3f9f50
EUC-JP 鄭?魚?d縡???蘖 110001011010001000111111101101011111101100111111101000111110010011100101110100100011111100111111001111111101110110110001 c5a23fb5fb3fa3e4e5d23f3f3fddb1
UTF-8 鄭렏魚편d縡댄렰렢蘖 111010011000010010101101111010111010000010001111111010011010110110011010111011011000111010111000111011111011110110000100111001111011100010100001111010111000110010000100111010111010000010110000111010111010000010100010111010001001100010010110 e984adeba08fe9ad9aed8eb8efbd84e7b8a1eb8c84eba0b0eba0a2e89896
UHC 鄭렏魚편d縡댄렰렢蘖 1110111111110111100011101010010111100101111000001100011011101101101000111110010011101110101011011011010011101101100011101011110110001110101100111110010111101110 eff78ea5e5e0c6eda3e4eeadb4ed8ebd8eb3e5ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)