To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????f?????^}Y?????f?????^}bE 0011111100111111001111110011111100111111011001100011111100111111001111110011111100111111010111100111110101011001001111110011111100111111001111110011111101100110001111110011111100111111001111110011111101011110011111010110001001000101 3f3f3f3f3f663f3f3f3f3f5e7d593f3f3f3f3f663f3f3f3f3f5e7d6245
SJIS-WIN 鄭???蠱f鄭???蠱^}Y鄭???蠱f鄭???蠱^}bE 10010011010000010011111100111111001111111110010111000001011001101001001101000001001111110011111100111111111001011100000101011110011111010101100110010011010000010011111100111111001111111110010111000001011001101001001101000001001111110011111100111111111001011100000101011110011111010110001001000101 93413f3f3fe5c16693413f3f3fe5c15e7d5993413f3f3fe5c16693413f3f3fe5c15e7d6245
EUC-JP 鄭???蠱f鄭???蠱^}Y鄭???蠱f鄭???蠱^}bE 11000101101000100011111100111111001111111110101011000011011001101100010110100010001111110011111100111111111010101100001101011110011111010101100111000101101000100011111100111111001111111110101011000011011001101100010110100010001111110011111100111111111010101100001101011110011111010110001001000101 c5a23f3f3feac366c5a23f3f3feac35e7d59c5a23f3f3feac366c5a23f3f3feac35e7d6245
UTF-8 鄭렏뤋내蠱f鄭렏뤋내蠱^}Y鄭렏뤋내蠱f鄭렏뤋내蠱^}bE 111010011000010010101101111010111010000010001111111010111010010010001011111010111000001010110100111010001010000010110001011001101110100110000100101011011110101110100000100011111110101110100100100010111110101110000010101101001110100010100000101100010101111001111101010110011110100110000100101011011110101110100000100011111110101110100100100010111110101110000010101101001110100010100000101100010110011011101001100001001010110111101011101000001000111111101011101001001000101111101011100000101011010011101000101000001011000101011110011111010110001001000101 e984adeba08feba48beb82b4e8a0b166e984adeba08feba48beb82b4e8a0b15e7d59e984adeba08feba48beb82b4e8a0b166e984adeba08feba48beb82b4e8a0b15e7d6245
UHC 鄭렏뤋내蠱f鄭렏뤋내蠱^}Y鄭렏뤋내蠱f鄭렏뤋내蠱^}bE 11101111111101111000111010100101100011111011101110110011101110111100110111001100011001101110111111110111100011101010010110001111101110111011001110111011110011011100110001011110011111010101100111101111111101111000111010100101100011111011101110110011101110111100110111001100011001101110111111110111100011101010010110001111101110111011001110111011110011011100110001011110011111010110001001000101 eff78ea58fbbb3bbcdcc66eff78ea58fbbb3bbcdcc5e7d59eff78ea58fbbb3bbcdcc66eff78ea58fbbb3bbcdcc5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)