To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????nB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e42
SJIS-WIN シセシオシセ狢シムシセシマシセシカシセシォnB 101111001011111010111100101101011011110010111110111000001100000010111100110100011011110010111110101111001100111110111100101111101011110010110110101111001011111010111100101010110110111001000010 bcbebcb5bcbee0c0bcd1bcbebccfbcbebcb6bcbebcab6e42
EUC-JP シセシオシセ狢シムシセシマシセシカシセシォnB 1000111010111100100011101011111010001110101111001000111010110101100011101011110010001110101111101110000011000010100011101011110010001110110100011000111010111100100011101011111010001110101111001000111011001111100011101011110010001110101111101000111010111100100011101011011010001110101111001000111010111110100011101011110010001110101010110110111001000010 8ebc8ebe8ebc8eb58ebc8ebee0c28ebc8ed18ebc8ebe8ebc8ecf8ebc8ebe8ebc8eb68ebc8ebe8ebc8eab6e42
UTF-8 シセシオシセ狢シムシセシマシセシカシセシォnB 1110111110111101101111001110111110111101101111101110111110111101101111001110111110111101101101011110111110111101101111001110111110111101101111101110011110001011101000101110111110111101101111001110111110111110100100011110111110111101101111001110111110111101101111101110111110111101101111001110111110111110100011111110111110111101101111001110111110111101101111101110111110111101101111001110111110111101101101101110111110111101101111001110111110111101101111101110111110111101101111001110111110111101101010110110111001000010 efbdbcefbdbeefbdbcefbdb5efbdbcefbdbee78ba2efbdbcefbe91efbdbcefbdbeefbdbcefbe8fefbdbcefbdbeefbdbcefbdb6efbdbcefbdbeefbdbcefbdab6e42
UHC ?????????????????????nB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)