To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 渧雫澈自溿・ソ 111110110100100010001110101101001111101101001011100011101010100111111011010010101010010110111111 fb488eb4fb4b8ea9fb4aa5bf
EUC-JP 渧雫澈自溿・ソ 1000111111000111111010111011110010110110100011111100100011100101101111001010101110001111110010001011000110001110101001011000111010111111 8fc7ebbcb68fc8e5bcab8fc8b18ea58ebf
UTF-8 渧雫澈自溿・ソ 111001101011100010100111111010011001101110101011111001101011111010001000111010001000011110101010111001101011101010111111111011111011110110100101111011111011110110111111 e6b8a7e99babe6be88e887aae6babfefbda5efbdbf
UHC ??澈自??? 001111110011111111110100110011011110110110111011001111110011111100111111 3f3ff4cdedbb3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)