Character and Charcode - Check how computer recognize characters

To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????????????????	00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN	夜??愿???る?癌??源??畑??愈	1001011011101001001111110011111110011100110000110011111100111111001111111000001011101001001111111000101011100000001111110011111110001100101110010011111100111111100101001010100000111111001111111001011011111010	96e93f3f9cc33f3f3f82e93f8ae03f3f8cb93f3f94a83f3f96fa
EUC-JP	夜??愿??瑗る?癌??源??畑??愈	11001100111010110011111100111111110110001100010100111111001111111000111111001100110000001010010011101011001111111011010011100010001111110011111110111000101110110011111100111111110010001010101000111111001111111100110011111100	cceb3f3fd8c53f3f8fccc0a4eb3fb4e23f3fb8bb3f3fc8aa3f3fccfc
UTF-8	夜좊ㅋ愿곤쫱瑗る뼍癌껊릪源덈뼍畑듬솁愈	111001011010010010011100111011001010001010001010111000111000010110001011111001101000010010111111111010101011001110100100111011001010101110110001111001111001000110010111111000111000001010001011111010111011110010001101111001111001100110001100111010101011101110001010111010111010011010101010111001101011101010010000111010111000110110001000111010111011110010001101111001111001010110010001111010111001001110101100111011001000011010000001111001101000010010001000	e5a49ceca28ae3858be684bfeab3a4ecabb1e79197e3828bebbc8de7998ceabb8aeba6aae6ba90eb8d88ebbc8de79591eb93acec8681e68488
UHC	夜좊ㅋ愿곤쫱瑗る뼍癌껊릪源덈뼍畑듬솁愈	1110010110101000101000001110101110100100101110111110101010110100101100001110111110100110100010011110101010111100101010101110101110010110100101011110010011011111100000111110101110010000100011001110101010111001100010001110101110010110100101011110111110100101101101011110101110011001100001101110101011101111	e5a8a0eba4bbeab4b0efa689eabcaaeb9695e4df83eb908ceab988eb9695efa5b5eb9986eaef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)