To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪担孫誰遜属巽炭揃辰尊促誰遜村誰遜側B 10010010010001111001001001010011100100011011011110010010010011101001000110111011100100011010111010010010010001101001001001011001100100011011010110010010010000111001000110111000100100011010001110010010010011101001000110111011100100011011101010010010010011101001000110111011100100011010010001000010 9247925391b7924e91bb91ae9246925991b5924391b891a3924e91bb91ba924e91bb91a442
EUC-JP 竪担孫誰遜属巽炭揃辰尊促誰遜村誰遜側B 11000011101010001100001110110100110000101011100111000011101011111100001010111101110000101011000011000011101001111100001110111010110000101011011111000011101001001100001010111010110000101010010111000011101011111100001010111101110000101011110011000011101011111100001010111101110000101010011001000010 c3a8c3b4c2b9c3afc2bdc2b0c3a7c3bac2b7c3a4c2bac2a5c3afc2bdc2bcc3afc2bdc2a642
UTF-8 竪担孫誰遜属巽炭揃辰尊促誰遜村誰遜側B 11100111101010111010101011100110100010111000010111100101101011011010101111101000101010101011000011101001100000011001110011100101101100011001111011100101101101111011110111100111100000101010110111100110100011111000001111101000101111101011000011100101101100001000101011100100101111111000001111101000101010101011000011101001100000011001110011100110100111011001000111101000101010101011000011101001100000011001110011100101100000011011010001000010 e7abaae68b85e5adabe8aab0e9819ce5b19ee5b7bde782ade68f83e8beb0e5b08ae4bf83e8aab0e9819ce69d91e8aab0e9819ce581b442
UHC 竪?孫誰遜?巽炭?辰尊促誰遜村誰遜側B 11100010101101010011111111100001110111011110001011000001111000011110000100111111111000011101111011110111101010010011111111110010111000111111000011101110111101011011010111100010110000011110000111100001111101011011110111100010110000011110000111100001111101101011000001000010 e2b53fe1dde2c1e1e13fe1def7a93ff2e3f0eef5b5e2c1e1e1f5bde2c1e1e1f6b042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)