To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?眩雋?眩雋^ 0011111111100001101111111110100010110010001111111110000110111111111010001011001001011110 3fe1bfe8b23fe1bfe8b25e
EUC-JP 峴眩雋峴眩雋^ 100011111011101111000001111000101100000111110000101101001000111110111011110000011110001011000001111100001011010001011110 8fbbc1e2c1f0b48fbbc1e2c1f0b45e
UTF-8 峴眩雋峴眩雋^ 11100101101100111011010011100111100111001010100111101001100110111000101111100101101100111011010011100111100111001010100111101001100110111000101101011110 e5b3b4e79ca9e99b8be5b3b4e79ca9e99b8b5e
UHC 峴眩雋峴眩雋^ 11111010110101101111101011011111111100011110011011111010110101101111101011011111111100011110011001011110 fad6fadff1e6fad6fadff1e65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)