To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 玉??淫??魚??B 10001011110010100011111100111111100010001111101000111111001111111000101110011011001111110011111101000010 8bca3f3f88fa3f3f8b9b3f3f42
EUC-JP 玉??淫??魚??B 10110110110011000011111100111111101100001111110000111111001111111011010111111011001111110011111101000010 b6cc3f3fb0fc3f3fb5fb3f3f42
UTF-8 玉뽪굢淫곮쵔魚됱꼤B 11100111100011101000100111101011101111011010101011101010101101011010001011100110101101111010101111101010101100111010111011101100101101011001010011101001101011011001101011101011100100001011000111101010101111001010010001000010 e78e89ebbdaaeab5a2e6b7abeab3aeecb594e9ad9aeb90b1eabca442
UHC 玉뽪굢淫곮쵔魚됱꼤B 11101000101011001001011011100110100000101000100111101011111000101000000111101000101011001001011011100101111000001000100111101100100001001000000101000010 e8ac96e68289ebe281e8ac96e5e089ec848142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)