To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^?????v???^?????vB 001111110011111100111111010111100011111100111111001111110011111100111111011101100011111100111111001111110101111000111111001111110011111100111111001111110111011001000010 3f3f3f5e3f3f3f3f3f763f3f3f5e3f3f3f3f3f7642
SJIS-WIN 吟?d^吟??蘖繃v吟?d^吟??蘖繃vB 10001011111000010011111110000010100001000101111010001011111000010011111100111111100111110101000011100011011111010111011010001011111000010011111110000010100001000101111010001011111000010011111100111111100111110101000011100011011111010111011001000010 8be13f82845e8be13f3f9f50e37d768be13f82845e8be13f3f9f50e37d7642
EUC-JP 吟?d^吟??蘖繃v吟?d^吟??蘖繃vB 10110110111000110011111110100011111001000101111010110110111000110011111100111111110111011011000111100101110111100111011010110110111000110011111110100011111001000101111010110110111000110011111100111111110111011011000111100101110111100111011001000010 b6e33fa3e45eb6e33f3fddb1e5de76b6e33fa3e45eb6e33f3fddb1e5de7642
UTF-8 吟㏘d^吟㏘쨩蘖繃v吟㏘d^吟㏘쨩蘖繃vB 1110010110010000100111111110001110001111100110001110111110111101100001000101111011100101100100001001111111100011100011111001100011101100101010001010100111101000100110001001011011100111101110011000001101110110111001011001000010011111111000111000111110011000111011111011110110000100010111101110010110010000100111111110001110001111100110001110110010101000101010011110100010011000100101101110011110111001100000110111011001000010 e5909fe38f98efbd845ee5909fe38f98eca8a9e89896e7b98376e5909fe38f98efbd845ee5909fe38f98eca8a9e89896e7b9837642
UHC 吟㏘d^吟㏘쨩蘖繃v吟㏘d^吟㏘쨩蘖繃vB 11101011111000011010001011100100101000111110010001011110111010111110000110100010111001001100001010111011111001011110111011011101110111100111011011101011111000011010001011100100101000111110010001011110111010111110000110100010111001001100001010111011111001011110111011011101110111100111011001000010 ebe1a2e4a3e45eebe1a2e4c2bbe5eeddde76ebe1a2e4a3e45eebe1a2e4c2bbe5eeddde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)