Character and Charcode - Check how computer recognize characters

To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????????????????	00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN	??弔?猥?伊豆?鎖????嚼??熊?	0011111100111111100100101010001000111111111000001100111000111111100010001100100110010011101001000011111110001101101111010011111100111111001111110011111110011010100100000011111100111111100011000100011000111111	3f3f92a23fe0ce3f88c993a43f8dbd3f3f3f3f9a903f3f8c463f
EUC-JP	??弔?猥?伊豆?鎖?雩??嚼??熊?	00111111001111111100010010100100001111111110000011010000001111111011000011001011110001101010011000111111101110101011111100111111100011111110011011111010001111110011111111010011111100000011111100111111101101111010011100111111	3f3fc4a43fe0d03fb0cbc6a63fbabf3f8fe6fa3f3fd3f03f3fb7a73f
UTF-8	亐렕弔렟猥렧伊豆뤈鎖떠雩컣룬嚼咽ㅤ熊녘	111001001011101010010000111010111010000010010101111001011011110010010100111010111010000010011111111001111000110010100101111010111010000010100111111001001011110010001010111010001011000110000110111010111010010010001000111010011000111010010110111010111001011010100000111010011001101110101001111011001011101110100011111010111010001110101100111001011001101010111100111011111010011010011110111000111000010110100100111001111000011010001010111010111000010110011000	e4ba90eba095e5bc94eba09fe78ca5eba0a7e4bc8ae8b186eba488e98e96eb96a0e99ba9ecbba3eba3ace59abcefa69ee385a4e7868aeb8598
UHC	亐렕弔렟猥렧伊豆뤈鎖떠雩컣룬嚼咽ㅤ熊녘	1110101010100111100011101010101011110000110000001000111010110000111010001110010110001110101101101110110010100101110101001110011110001111101110001110000111110000101101101011000011101001111011001011000010001110101101111110100111101101110001001110011011101100101001001101010011101010101010001011001111101000	eaa78eaaf0c08eb0e8e58eb6eca5d4e78fb8e1f0b6b0e9ecb08eb7e9edc4e6eca4d4eaa8b3e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)