To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竪息族奪遜揃竪息族奪遜揃^ 10010010010001111001000110100111100100011011000010010010010001001001000110111011100100011011010110010010010001111001000110100111100100011011000010010010010001001001000110111011100100011011010101011110 924791a791b0924491bb91b5924791a791b0924491bb91b55e
EUC-JP 竪息族奪遜揃竪息族奪遜揃^ 11000011101010001100001010101001110000101011001011000011101001011100001010111101110000101011011111000011101010001100001010101001110000101011001011000011101001011100001010111101110000101011011101011110 c3a8c2a9c2b2c3a5c2bdc2b7c3a8c2a9c2b2c3a5c2bdc2b75e
UTF-8 竪息族奪遜揃竪息族奪遜揃^ 11100111101010111010101011100110100000011010111111100110100101111000111111100101101001011010101011101001100000011001110011100110100011111000001111100111101010111010101011100110100000011010111111100110100101111000111111100101101001011010101011101001100000011001110011100110100011111000001101011110 e7abaae681afe6978fe5a5aae9819ce68f83e7abaae681afe6978fe5a5aae9819ce68f835e
UHC 竪息族奪遜?竪息族奪遜?^ 1110001010110101111000111101001111110000111010011111011110101100111000011110000100111111111000101011010111100011110100111111000011101001111101111010110011100001111000010011111101011110 e2b5e3d3f0e9f7ace1e13fe2b5e3d3f0e9f7ace1e13f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)