To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鷹コ????ヒ⊂????ゴ?仇??瓣 00111111001111110011111110010001111010011000001101010010001111110011111100111111001111111000001101110001100000011011110000111111001111110011111100111111100000110101001100111111100010110111011100111111001111111110000101000001 3f3f3f91e983523f3f3f3f837181bc3f3f3f3f83533f8b773f3fe141
EUC-JP ???鷹コ????ヒ⊂????ゴ?仇??瓣 00111111001111110011111111000010111010111010010110110011001111110011111100111111001111111010010111010010101000101011111000111111001111110011111100111111101001011011010000111111101101011101100000111111001111111110000110100010 3f3f3fc2eba5b33f3f3f3fa5d2a2be3f3f3f3fa5b43fb5d83f3fe1a2
UTF-8 룴절룫鷹コ룫熉룶퀛ヒ⊂룶⅝룫킃ゴ룶仇룶뇨瓣 111010111010001110110100111011001010000010001000111010111010001110101011111010011011011110111001111000111000001010110011111010111010001110101011111001111000011010001001111010111010001110110110111011011000000010011011111000111000001110010010111000101000101010000010111010111010001110110110111000101000010110011101111010111010001110101011111011011000001010000011111000111000001010110100111010111010001110110110111001001011101110000111111010111010001110110110111010111000011110101000111001111001001110100011 eba3b4eca088eba3abe9b7b9e382b3eba3abe78689eba3b6ed809be38392e28a82eba3b6e2859deba3abed8283e382b4eba3b6e4bb87eba3b6eb87a8e793a3
UHC 룴절룫鷹コ룫熉룶퀛ヒ⊂룶⅝룫킃ゴ룶仇룶뇨瓣 100011111010100111000000111111011000111110100010111010111110110110101011101100111000111110100010111010011111101110001111101010111011001110001111101010111101001010100001111110001000111110101011101010001111110110001111101000101011010010001111101010111011010010001111101010111100111011111011100011111010101110110100101000101111011111111011 8fa9c0fd8fa2ebedabb38fa2e9fb8fabb38fabd2a1f88faba8fd8fa2b48fabb48fabcefb8fabb4a2f7fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)