To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鸚?????幽??B 111010100101111100111111001111110011111100111111001111111001011101001000001111110011111101000010 ea5f3f3f3f3f3f97483f3f42
EUC-JP 鸚??荑??幽??B 1111001111000000001111110011111110001111110101111111100100111111001111111100110110101001001111110011111101000010 f3c03f3f8fd7f93f3fcda93f3f42
UTF-8 鸚룸뜆荑뗥ㅇ幽됱퓢B 11101001101110001001101011101011101000111011100011101011100111001000011011101000100011011001000111101011100101111010010111100011100001011000011111100101101110011011110111101011100100001011000111101101100100111010001001000010 e9b89aeba3b8eb9c86e88d91eb97a5e38587e5b9bdeb90b1ed93a242
UHC 鸚룸뜆荑뗥ㅇ幽됱퓢B 11100101101001001011011111101011100011011000100111101100101111111000101111100101101001001011011111101010111010111000100111101100101111111000101101000010 e5a4b7eb8d89ecbf8be5a4b7eaeb89ecbf8b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)