To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪淡孫誰遜多巽短贈誰遜卒辿束損誰遜損竪湛 10010010010001111001001001010111100100011011011110010010010011101001000110111011100100011011110110010010010001101001001001011010100100011010000110010010010011101001000110111011100100011011001010010010010010001001000110101001100100011011100110010010010011101001000110111011100100011011100110010010010001111001001001011000 9247925791b7924e91bb91bd9246925a91a1924e91bb91b2924891a991b9924e91bb91b992479258
EUC-JP 竪淡孫誰遜多巽短贈誰遜卒辿束損誰遜損竪湛 11000011101010001100001110111000110000101011100111000011101011111100001010111101110000101011111111000011101001111100001110111011110000101010001111000011101011111100001010111101110000101011010011000011101010011100001010101011110000101011101111000011101011111100001010111101110000101011101111000011101010001100001110111001 c3a8c3b8c2b9c3afc2bdc2bfc3a7c3bbc2a3c3afc2bdc2b4c3a9c2abc2bbc3afc2bdc2bbc3a8c3b9
UTF-8 竪淡孫誰遜多巽短贈誰遜卒辿束損誰遜損竪湛 111001111010101110101010111001101011011110100001111001011010110110101011111010001010101010110000111010011000000110011100111001011010010010011010111001011011011110111101111001111001111110101101111010001011010010001000111010001010101010110000111010011000000110011100111001011000110110010010111010001011111010111111111001101001110110011111111001101001000010001101111010001010101010110000111010011000000110011100111001101001000010001101111001111010101110101010111001101011100110011011 e7abaae6b7a1e5adabe8aab0e9819ce5a49ae5b7bde79fade8b488e8aab0e9819ce58d92e8bebfe69d9fe6908de8aab0e9819ce6908de7abaae6b99b
UHC 竪淡孫誰遜多巽短贈誰遜卒?束損誰遜損竪湛 111000101011010111010011101111111110000111011101111000101100000111100001111000011101001011111101111000011101111011010011101011011111000111111100111000101100000111100001111000011111000011101111001111111110000111010110111000011101111111100010110000011110000111100001111000011101111111100010101101011101001111000000 e2b5d3bfe1dde2c1e1e1d2fde1ded3adf1fce2c1e1e1f0ef3fe1d6e1dfe2c1e1e1e1dfe2b5d3c0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)