Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????????	00111111001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f3f
SJIS-WIN	底??底??嚥??底	1001001011101010001111110011111110010010111010100011111100111111100110101000101100111111001111111001001011101010	92ea3f3f92ea3f3f9a8b3f3f92ea
EUC-JP	底??底??嚥??底	1100010011101100001111110011111111000100111011000011111100111111110100111110101100111111001111111100010011101100	c4ec3f3fc4ec3f3fd3eb3f3fc4ec
UTF-8	底쇽스底억슝嚥드ㅁ底	111001011011101010010101111011001000011110111101111011001000101010100100111001011011101010010101111011001001011010110101111011001000101010011101111001011001101010100101111010111001001110011100111000111000010110000001111001011011101010010101	e5ba95ec87bdec8aa4e5ba95ec96b5ec8a9de59aa5eb939ce38581e5ba95
UHC	底쇽스底억슝嚥드ㅁ底	1110111010111100101111001110111110111101101110101110111010111100101111101110111110111101101110011110011010111111101101011110010110100100101100011110111010111100	eebcbcefbdbaeebcbeefbdb9e6bfb5e5a4b1eebc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)