To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 獄??揖??怨?? 100011011001011000111111001111111001011101001011001111110011111110001001100001010011111100111111 8d963f3f974b3f3f89853f3f
EUC-JP 獄??揖??怨?? 101110011111011000111111001111111100110110101100001111110011111110110001111001010011111100111111 b9f63f3fcdac3f3fb1e53f3f
UTF-8 獄뷜뫖揖닸궇怨살춲 111001111000110110000100111010111011011110011100111010111010101110010110111001101000111110010110111010111000101110111000111010101011011010000111111001101000000010101000111011001000001010110100111011001011011010110010 e78d84ebb79cebab96e68f96eb8bb8eab687e680a8ec82b4ecb6b2
UHC 獄뷜뫖揖닸궇怨살춲 111010001010101110111010111000101001000110111000111010111110011110110100111001101000001010100000111010101011001110111011111011001010110110001110 e8abbae291b8ebe7b4e682a0eab3bbecad8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)