To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ツ古渉仰篠古渉仰撮 1100001010001100110000111000111111000010100010111100001010001110110000101000110011000011100011111100001010001011110000101000111001000010 c28cc38fc28bc28ec28cc38fc28bc28e42
EUC-JP ツ古渉仰篠古渉仰撮 100011101100001010111000110001011011111011000100101101101100010010111100110001001011100011000101101111101100010010110110110001001011101110100011 8ec2b8c5bec4b6c4bcc4b8c5bec4b6c4bba3
UTF-8 ツ古渉仰篠古渉仰撮 111011111011111010000010111001011000111110100100111001101011100010001001111001001011101110110000111001111010111110100000111001011000111110100100111001101011100010001001111001001011101110110000111001101001001010101110 efbe82e58fa4e6b889e4bbb0e7afa0e58fa4e6b889e4bbb0e692ae
UHC ?古?仰篠古?仰撮 001111111100110110101111001111111110010011100110111000011100011011001101101011110011111111100100111001101111010111001001 3fcdaf3fe4e6e1c6cdaf3fe4e6f5c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)