To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??嗽?6鼇γ? 1001011101010001001111110011111110011010011101010011111110000010010101011110101010000111100000111100000100111111 97513f3f9a753f8255ea8783c13f
EUC-JP 猷??嗽?6鼇γ? 1100110110110010001111110011111111010011110101100011111110100011101101101111001111100111101001101100001100111111 cdb23f3fd3d63fa3b6f3e7a6c33f
UTF-8 猷띰쩃嗽덈6鼇γ궕 1110011110001100101101111110101110011101101100001110110010101001100000111110010110010111101111011110101110001101100010001110111110111100100101101110100110111100100001111100111010110011111010101011011010010101 e78cb7eb9db0eca983e597bdeb8d88efbc96e9bc87ceb3eab695
UHC 猷띰쩃嗽덈6鼇γ궕 111010111010001110110110111011111010010010011101111000011111010110001000111010111010001110110110111010001010100010100101111000111000001010101010 eba3b6efa49de1f588eba3b6e8a8a5e382aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)