To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????B 0011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f42
SJIS-WIN ??咫枇コ∫◇B 00111111001111111001101001000000100101001111100010000011010100101000000111100111100000011001111001000010 3f3f9a4094f8835281e7819e42
EUC-JP ??咫枇コ∫◇B 00111111001111111101001110100001110010001111101010100101101100111010001011101001101000011111111001000010 3f3fd3a1c8faa5b3a2e9a1fe42
UTF-8 룶끝咫枇コ∫◇B 11101011101000111011011011101011100000011001110111100101100100101010101111100110100111101000011111100011100000101011001111100010100010001010101111100010100101111000011101000010 eba3b6eb819de592abe69e87e382b3e288abe2978742
UHC 룶끝咫枇コ∫◇B 100011111010101110110011101000011111001010100001110111011110110110101011101100111010000111110010101000011101111001000010 8fabb3a1f2a1ddedabb3a1f2a1de42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)