To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?荊?宏?荊?槐^ 00111111100011000111010000111111100011010100011100111111100011000111010000111111100111101100010101011110 3f8c743f8d473f8c743f9ec55e
EUC-JP ?荊?宏?荊?槐^ 00111111101101111101010100111111101110011010100000111111101101111101010100111111110111001100011101011110 3fb7d53fb9a83fb7d53fdcc75e
UTF-8 뤗荊퍗宏뤗荊퍗槐^ 11101011101001001001011111101000100011011000101011101101100011011001011111100101101011101000111111101011101001001001011111101000100011011000101011101101100011011001011111100110101001111001000001011110 eba497e88d8aed8d97e5ae8feba497e88d8aed8d97e6a7905e
UHC 뤗荊퍗宏뤗荊퍗槐^ 1000111111000111111110111010101010111011100011101100111011011011100011111100011111111011101010101011101110001110110011101101100101011110 8fc7fbaabb8ecedb8fc7fbaabb8eced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)