To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?LCznf?LCzn^}Y?LCznf?LCzn^}bE 0011111101001100010000110111101001101110011001100011111101001100010000110111101001101110010111100111110101011001001111110100110001000011011110100110111001100110001111110100110001000011011110100110111001011110011111010110001001000101 3f4c437a6e663f4c437a6e5e7d593f4c437a6e663f4c437a6e5e7d6245
SJIS-WIN 衷LCznf衷LCzn^}Y衷LCznf衷LCzn^}bE 100100101000111101001100010000110111101001101110011001101001001010001111010011000100001101111010011011100101111001111101010110011001001010001111010011000100001101111010011011100110011010010010100011110100110001000011011110100110111001011110011111010110001001000101 928f4c437a6e66928f4c437a6e5e7d59928f4c437a6e66928f4c437a6e5e7d6245
EUC-JP 衷LCznf衷LCzn^}Y衷LCznf衷LCzn^}bE 110000111110111101001100010000110111101001101110011001101100001111101111010011000100001101111010011011100101111001111101010110011100001111101111010011000100001101111010011011100110011011000011111011110100110001000011011110100110111001011110011111010110001001000101 c3ef4c437a6e66c3ef4c437a6e5e7d59c3ef4c437a6e66c3ef4c437a6e5e7d6245
UTF-8 衷LCznf衷LCzn^}Y衷LCznf衷LCzn^}bE 11101000101000011011011101001100010000110111101001101110011001101110100010100001101101110100110001000011011110100110111001011110011111010101100111101000101000011011011101001100010000110111101001101110011001101110100010100001101101110100110001000011011110100110111001011110011111010110001001000101 e8a1b74c437a6e66e8a1b74c437a6e5e7d59e8a1b74c437a6e66e8a1b74c437a6e5e7d6245
UHC 衷LCznf衷LCzn^}Y衷LCznf衷LCzn^}bE 111101011111101101001100010000110111101001101110011001101111010111111011010011000100001101111010011011100101111001111101010110011111010111111011010011000100001101111010011011100110011011110101111110110100110001000011011110100110111001011110011111010110001001000101 f5fb4c437a6e66f5fb4c437a6e5e7d59f5fb4c437a6e66f5fb4c437a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)