To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 狸鱈他辿遜樽狸誰村辿孫誰狸誰村辿遜樽B 10010010010010111001001001001100100100011011110010010010010010001001000110111011100100100100110110010010010010111001001001001110100100011011101010010010010010001001000110110111100100100100111010010010010010111001001001001110100100011011101010010010010010001001000110111011100100100100110101000010 924b924c91bc924891bb924d924b924e91ba924891b7924e924b924e91ba924891bb924d42
EUC-JP 狸鱈他辿遜樽狸誰村辿孫誰狸誰村辿遜樽B 11000011101011001100001110101101110000101011111011000011101010011100001010111101110000111010111011000011101011001100001110101111110000101011110011000011101010011100001010111001110000111010111111000011101011001100001110101111110000101011110011000011101010011100001010111101110000111010111001000010 c3acc3adc2bec3a9c2bdc3aec3acc3afc2bcc3a9c2b9c3afc3acc3afc2bcc3a9c2bdc3ae42
UTF-8 狸鱈他辿遜樽狸誰村辿孫誰狸誰村辿遜樽B 11100111100010111011100011101001101100011000100011100100101110111001011011101000101111101011111111101001100000011001110011100110101010001011110111100111100010111011100011101000101010101011000011100110100111011001000111101000101111101011111111100101101011011010101111101000101010101011000011100111100010111011100011101000101010101011000011100110100111011001000111101000101111101011111111101001100000011001110011100110101010001011110101000010 e78bb8e9b188e4bb96e8bebfe9819ce6a8bde78bb8e8aab0e69d91e8bebfe5adabe8aab0e78bb8e8aab0e69d91e8bebfe9819ce6a8bd42
UHC 狸?他?遜樽狸誰村?孫誰狸誰村?遜樽B 110101111110000100111111111101101110001000111111111000011110000111110001110111001101011111100001111000101100000111110101101111010011111111100001110111011110001011000001110101111110000111100010110000011111010110111101001111111110000111100001111100011101110001000010 d7e13ff6e23fe1e1f1dcd7e1e2c1f5bd3fe1dde2c1d7e1e2c1f5bd3fe1e1f1dc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)