To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 眸翦眸ヨ聶ニ翦眸翳眸翦眸ヨ聶ニ翦眸翳B 111000011100011011100011110001101110000111000110110101101110001111100001110001101110001111000110111000011100011011100011110010001110000111000110111000111100011011100001110001101101011011100011111000011100011011100011110001101110000111000110111000111100100001000010 e1c6e3c6e1c6d6e3e1c6e3c6e1c6e3c8e1c6e3c6e1c6d6e3e1c6e3c6e1c6e3c842
EUC-JP 眸翦眸ヨ聶ニ翦眸翳眸翦眸ヨ聶ニ翦眸翳B 11100010110010001110011011001000111000101100100010001110110101101110011011100011100011101100011011100110110010001110001011001000111001101100101011100010110010001110011011001000111000101100100010001110110101101110011011100011100011101100011011100110110010001110001011001000111001101100101001000010 e2c8e6c8e2c88ed6e6e38ec6e6c8e2c8e6cae2c8e6c8e2c88ed6e6e38ec6e6c8e2c8e6ca42
UTF-8 眸翦眸ヨ聶ニ翦眸翳眸翦眸ヨ聶ニ翦眸翳B 11100111100111001011100011100111101111111010011011100111100111001011100011101111101111101001011011101000100000011011011011101111101111101000011011100111101111111010011011100111100111001011100011100111101111111011001111100111100111001011100011100111101111111010011011100111100111001011100011101111101111101001011011101000100000011011011011101111101111101000011011100111101111111010011011100111100111001011100011100111101111111011001101000010 e79cb8e7bfa6e79cb8efbe96e881b6efbe86e7bfa6e79cb8e7bfb3e79cb8e7bfa6e79cb8efbe96e881b6efbe86e7bfa6e79cb8e7bfb342
UHC 眸?眸????眸?眸?眸????眸?B 11011001110000100011111111011001110000100011111100111111001111110011111111011001110000100011111111011001110000100011111111011001110000100011111100111111001111110011111111011001110000100011111101000010 d9c23fd9c23f3f3f3fd9c23fd9c23fd9c23f3f3f3fd9c23f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)