To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???蘖??蟯??^ 001111110011111100111111100111110101000000111111001111111110010110110010001111110011111101011110 3f3f3f9f503f3fe5b23f3f5e
EUC-JP ???蘖??蟯??^ 001111110011111100111111110111011011000100111111001111111110101010110100001111110011111101011110 3f3f3fddb13f3feab43f3f5e
UTF-8 麗닻㉦蘖띷객蟯룡릫^ 11101111101001101000100011101011100010111011101111100011100010011010011011101000100110001001011011101011100111011011011111101010101100001001110111101000100111111010111111101011101000111010000111101011101001101010101101011110 efa688eb8bbbe389a6e89896eb9db7eab09de89fafeba3a1eba6ab5e
UHC 麗닻㉦蘖띷객蟯룡릫^ 11100110101100001011010011101001101010001011011111100101111011101000110111100110101100001011010011101001101010001011011111100110100100001000110101011110 e6b0b4e9a8b7e5ee8de6b0b4e9a8b7e6908d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)