To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???松??鴉??B 001111110011111100111111100011111011110000111111001111111110100111101011001111110011111101000010 3f3f3f8fbc3f3fe9eb3f3f42
EUC-JP 倻??松??鴉??B 1000111110110001111101100011111100111111101111101011111000111111001111111111001011101101001111110011111101000010 8fb1f63f3fbebe3f3ff2ed3f3f42
UTF-8 倻뽨펿松덂쫿鴉롡늿B 11100101100000001011101111101011101111011010100011101101100011101011111111100110100111011011111011101011100011011000001011101100101010111011111111101001101101001000100111101011101000011010000111101011100010101011111101000010 e580bbebbda8ed8ebfe69dbeeb8d82ecabbfe9b489eba1a1eb8abf42
UHC 倻뽨펿松덂쫿鴉롡늿B 11100101101001101001011011100100101111001000111011100001111001101000100011100101101001101001011011100100101111001000111011100010100010001000100001000010 e5a696e4bc8ee1e688e5a696e4bc8ee2888842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)