Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????O	00111111001111110011111100111111001111110011111100111111001111110011111101001111	3f3f3f3f3f3f3f3f3f4f
SJIS-WIN	語??誼?Ⅴ儒??O	1000110011101010001111110011111110001011011000100011111110000111010110001000111011110010001111110011111101001111	8cea3f3f8b623f87588ef23f3f4f
EUC-JP	語??誼??儒??O	10111000111011000011111100111111101101011100001100111111001111111011110011110100001111110011111101001111	b8ec3f3fb5c33f3fbcf43f3f4f
UTF-8	語뤴뫖誼믭Ⅴ儒몄졋O	11101000101010101001111011101011101001001011010011101011101010111001011011101000101010101011110011101011101011111010110111100010100001011010010011100101100001001001001011101011101010101000010011101100101000011000101101001111	e8aa9eeba4b4ebab96e8aabcebafade285a4e58492ebaa84eca18b4f
UHC	語뤴뫖誼믭Ⅴ儒몄졋O	11100101110111101000111111100010100100011011100011101011111111101001001011101111101001011011010011101010111000111011100011101100101000001011101001001111	e5de8fe291b8ebfe92efa5b4eae3b8eca0ba4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)