To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嶸??唯??臾??B 11111010101101000011111100111111100101110100001000111111001111111110010001101011001111110011111101000010 fab43f3f97423f3fe46b3f3f42
EUC-JP 嶸??唯??臾??B 1000111110111011111101000011111100111111110011011010001100111111001111111110011111001100001111110011111101000010 8fbbf43f3fcda33f3fe7cc3f3f42
UTF-8 嶸뗫쵉唯묊츐臾뗫닊B 11100101101101101011100011101011100101111010101111101100101101011000100111100101100101001010111111101011101011001000101011101100101110001001000011101000100001111011111011101011100101111010101111101011100010111000101001000010 e5b6b8eb97abecb589e594afebac8aecb890e887beeb97abeb8b8a42
UHC 嶸뗫쵉唯묊츐臾뗫닊B 11100111101011101000101111101011101011001000101111101010111001101001000111100111101011101000101111101011101011001000101111101011100010001001000101000010 e7ae8bebac8beae691e7ae8bebac8beb889142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)