To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 閾搾、宋閾搾、宋B 111010001000011110001101111011111010010010010001011101101110100010000111100011011110111110100100100100010111011001000010 e8878defa49176e8878defa4917642
EUC-JP 閾搾、宋閾搾、宋B 1110111111100111101110101111000110001110101001001100000111010111111011111110011110111010111100011000111010100100110000011101011101000010 efe7baf18ea4c1d7efe7baf18ea4c1d742
UTF-8 閾搾、宋閾搾、宋B 11101001100101101011111011100110100100001011111011101111101111011010010011100101101011101000101111101001100101101011111011100110100100001011111011101111101111011010010011100101101011101000101101000010 e996bee690beefbda4e5ae8be996bee690beefbda4e5ae8b42
UHC ?搾?宋?搾?宋B 00111111111100111011011000111111111000011110010000111111111100111011011000111111111000011110010001000010 3ff3b63fe1e43ff3b63fe1e442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)