To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 誰遜村誰遜遜巽賊袖誰遜村誰遜遜巽賊促^ 10010010010011101001000110111011100100011011101010010010010011101001000110111011100100011011101110010010010001101001000110101111100100011011001110010010010011101001000110111011100100011011101010010010010011101001000110111011100100011011101110010010010001101001000110101111100100011010001101011110 924e91bb91ba924e91bb91bb924691af91b3924e91bb91ba924e91bb91bb924691af91a35e
EUC-JP 誰遜村誰遜遜巽賊袖誰遜村誰遜遜巽賊促^ 11000011101011111100001010111101110000101011110011000011101011111100001010111101110000101011110111000011101001111100001010110001110000101011010111000011101011111100001010111101110000101011110011000011101011111100001010111101110000101011110111000011101001111100001010110001110000101010010101011110 c3afc2bdc2bcc3afc2bdc2bdc3a7c2b1c2b5c3afc2bdc2bcc3afc2bdc2bdc3a7c2b1c2a55e
UTF-8 誰遜村誰遜遜巽賊袖誰遜村誰遜遜巽賊促^ 11101000101010101011000011101001100000011001110011100110100111011001000111101000101010101011000011101001100000011001110011101001100000011001110011100101101101111011110111101000101100111000101011101000101000101001011011101000101010101011000011101001100000011001110011100110100111011001000111101000101010101011000011101001100000011001110011101001100000011001110011100101101101111011110111101000101100111000101011100100101111111000001101011110 e8aab0e9819ce69d91e8aab0e9819ce9819ce5b7bde8b38ae8a296e8aab0e9819ce69d91e8aab0e9819ce9819ce5b7bde8b38ae4bf835e
UHC 誰遜村誰遜遜巽賊袖誰遜村誰遜遜巽賊促^ 11100010110000011110000111100001111101011011110111100010110000011110000111100001111000011110000111100001110111101110111011100100111000101100000011100010110000011110000111100001111101011011110111100010110000011110000111100001111000011110000111100001110111101110111011100100111101011011010101011110 e2c1e1e1f5bde2c1e1e1e1e1e1deeee4e2c0e2c1e1e1f5bde2c1e1e1e1e1e1deeee4f5b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)