To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪捉則誰遜側竪担村奪孫蔵誰遜他脱即其辰奪 10010010010001111001000110101000100100011010010110010010010011101001000110111011100100011010010010010010010001111001001001010011100100011011101010010010010001001001000110110111100100011010000010010010010011101001000110111011100100011011110010010010010001011001000110100110100100011011010010010010010000111001001001000100 924791a891a5924e91bb91a49247925391ba924491b791a0924e91bb91bc924591a691b492439244
EUC-JP 竪捉則誰遜側竪担村奪孫蔵誰遜他脱即其辰奪 11000011101010001100001010101010110000101010011111000011101011111100001010111101110000101010011011000011101010001100001110110100110000101011110011000011101001011100001010111001110000101010001011000011101011111100001010111101110000101011111011000011101001101100001010101000110000101011011011000011101001001100001110100101 c3a8c2aac2a7c3afc2bdc2a6c3a8c3b4c2bcc3a5c2b9c2a2c3afc2bdc2bec3a6c2a8c2b6c3a4c3a5
UTF-8 竪捉則誰遜側竪担村奪孫蔵誰遜他脱即其辰奪 111001111010101110101010111001101000110110001001111001011000100110000111111010001010101010110000111010011000000110011100111001011000000110110100111001111010101110101010111001101000101110000101111001101001110110010001111001011010010110101010111001011010110110101011111010001001010010110101111010001010101010110000111010011000000110011100111001001011101110010110111010001000010010110001111001011000110110110011111001011000010110110110111010001011111010110000111001011010010110101010 e7abaae68d89e58987e8aab0e9819ce581b4e7abaae68b85e69d91e5a5aae5adabe894b5e8aab0e9819ce4bb96e884b1e58db3e585b6e8beb0e5a5aa
UHC 竪捉則誰遜側竪?村奪孫?誰遜他??其辰奪 111000101011010111110011101101011111011011001110111000101100000111100001111000011111011010110000111000101011010100111111111101011011110111110111101011001110000111011101001111111110001011000001111000011110000111110110111000100011111100111111110100001110110011110010111000111111011110101100 e2b5f3b5f6cee2c1e1e1f6b0e2b53ff5bdf7ace1dd3fe2c1e1e1f6e23f3fd0ecf2e3f7ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)