To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 蔭????????蔭????????淫 ̄h 10001000111111000011111100111111001111110011111100111111001111110011111100111111100010001111110000111111001111110011111100111111001111110011111100111111001111111000100011111010100000010101000001101000 88fc3f3f3f3f3f3f3f3f88fc3f3f3f3f3f3f3f3f88fa815068
EUC-JP 蔭????????蔭????????淫 ̄h 10110000111111100011111100111111001111110011111100111111001111110011111100111111101100001111111000111111001111110011111100111111001111110011111100111111001111111011000011111100101000011011000101101000 b0fe3f3f3f3f3f3f3f3fb0fe3f3f3f3f3f3f3f3fb0fca1b168
UTF-8 蔭띾졎溜띿뀓溜삳젶蔭띾졎溜쀬뀓溜븍졏淫 ̄h 11101000100101001010110111101011100111011011111011101100101000011000111011101111101001111000101111101011100111011011111111101011100000001001001111101111101001111000101111101100100000101011001111101100101000001011011011101000100101001010110111101011100111011011111011101100101000011000111011101111101001111000101111101100100000001010110011101011100000001001001111101111101001111000101111101011101110001000110111101100101000011000111111100110101101111010101111101111101111111010001101101000 e894adeb9dbeeca18eefa78beb9dbfeb8093efa78bec82b3eca0b6e894adeb9dbeeca18eefa78bec80aceb8093efa78bebb88deca18fe6b7abefbfa368
UHC 蔭띾졎溜띿뀓溜삳젶蔭띾졎溜쀬뀓溜븍졏淫 ̄h 1110101111100011100011011110101110100000101110111110101011111110100011011110110010000101100011011110101011111110101110111110101110100000101010101110101111100011100011011110101110100000101110111110101011111110100101111110110010000101100011011110101011111110101110101110101110100000101111001110101111100010101000111111111001101000 ebe38deba0bbeafe8dec858deafebbeba0aaebe38deba0bbeafe97ec858deafebaeba0bcebe2a3fe68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)