To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 嘆短嘆狸嘆短嘆狸B 1001001001010001100100100101101010010010010100011001001001001011100100100101000110010010010110101001001001010001100100100100101101000010 9251925a9251924b9251925a9251924b42
EUC-JP 嘆短嘆狸嘆短嘆狸B 1100001110110010110000111011101111000011101100101100001110101100110000111011001011000011101110111100001110110010110000111010110001000010 c3b2c3bbc3b2c3acc3b2c3bbc3b2c3ac42
UTF-8 嘆短嘆狸嘆短嘆狸B 11100101100110001000011011100111100111111010110111100101100110001000011011100111100010111011100011100101100110001000011011100111100111111010110111100101100110001000011011100111100010111011100001000010 e59886e79fade59886e78bb8e59886e79fade59886e78bb842
UHC 嘆短嘆狸嘆短嘆狸B 1111011110100011110100111010110111110111101000111101011111100001111101111010001111010011101011011111011110100011110101111110000101000010 f7a3d3adf7a3d7e1f7a3d3adf7a3d7e142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)