To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?蒡???肢????????班頸?????^ 0011111111100100111011100011111100111111001111111000111010001000001111110011111100111111001111110011111100111111001111110011111110010100110001111110100011110010001111110011111100111111001111110011111101011110 3fe4ee3f3f3f8e883f3f3f3f3f3f3f3f94c7e8f23f3f3f3f3f5e
EUC-JP ?蒡???肢???芚????班頸?????^ 00111111111010001111000000111111001111110011111110111011111010000011111100111111001111111000111111010111101110110011111100111111001111110011111111001000110010011111000011110100001111110011111100111111001111110011111101011110 3fe8f03f3f3fbbe83f3f3f8fd7bb3f3f3f3fc8c9f0f43f3f3f3f3f5e
UTF-8 뤊蒡쨵붜롆肢렎뤉톻芚폀붜샴뤊班頸텕쨵뭍롏렯^ 11101011101001001000101011101000100100101010000111101100101010001011010111101011101101101001110011101011101000011000011011101000100000101010001011101011101000001000111011101011101001001000100111101101100001101011101111101000100010101001101011101101100011111000000011101011101101101001110011101100100000111011010011101011101001001000101011100111100011111010110111101001101000001011100011101101100001011001010111101100101010001011010111101011101011011000110111101011101000011000111111101011101000001010111101011110 eba48ae892a1eca8b5ebb69ceba186e882a2eba08eeba489ed86bbe88a9aed8f80ebb69cec83b4eba48ae78fade9a0b8ed8595eca8b5ebad8deba18feba0af5e
UHC 뤊蒡쨵붜롆肢렎뤉톻芚폀붜샴뤊班頸텕쨵뭍롏렯^ 10001111101110101101101110111100101001001000111110111010110110111000111011001100111100101011011010001110101001001000111110111001101101111000111011010100111011001011110010001111101110101101101110111100101001001000111110111010110110101110110011001100111100101011011010001110101001001000111110111001101101111000111011010101100011101011110001011110 8fbadbbca48fbadb8eccf2b68ea48fb9b78ed4ecbc8fbadbbca48fbadaecccf2b68ea48fb9b78ed58ebc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)