To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?鴦頸??鴦頸?^ 00111111111010011111000111101000111100100011111100111111111010011111000111101000111100100011111101011110 3fe9f1e8f23f3fe9f1e8f23f5e
EUC-JP ?鴦頸??鴦頸?^ 00111111111100101111001111110000111101000011111100111111111100101111001111110000111101000011111101011110 3ff2f3f0f43f3ff2f3f0f43f5e
UTF-8 뤙鴦頸뫙뤙鴦頸뫙^ 11101011101001001001100111101001101101001010011011101001101000001011100011101011101010111001100111101011101001001001100111101001101101001010011011101001101000001011100011101011101010111001100101011110 eba499e9b4a6e9a0b8ebab99eba499e9b4a6e9a0b8ebab995e
UHC 뤙鴦頸뫙뤙鴦頸뫙^ 1000111111001000111001001110110011001100111100101011100011111101100011111100100011100100111011001100110011110010101110001111110101011110 8fc8e4ecccf2b8fd8fc8e4ecccf2b8fd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)