To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 銑遜遜銑遜遜B 10010001010011001001000110111011100100011011101110010001010011001001000110111011100100011011101101000010 914c91bb91bb914c91bb91bb42
EUC-JP 銑遜遜銑遜遜B 11000001101011011100001010111101110000101011110111000001101011011100001010111101110000101011110101000010 c1adc2bdc2bdc1adc2bdc2bd42
UTF-8 銑遜遜銑遜遜B 11101001100010101001000111101001100000011001110011101001100000011001110011101001100010101001000111101001100000011001110011101001100000011001110001000010 e98a91e9819ce9819ce98a91e9819ce9819c42
UHC 銑遜遜銑遜遜B 11100000110101011110000111100001111000011110000111100000110101011110000111100001111000011110000101000010 e0d5e1e1e1e1e0d5e1e1e1e142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)