To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シセシツシセシネ湿疾湿爵湿実湿゚諶シシラ 10111100101111101011110011000010101111001011111010111100110010001000111010111100100011101011111010001110101111001000111011011101100011101011110010001110110000001000111010111100110111111111101110101010101111001011110011010111 bcbebcc2bcbebcc88ebc8ebe8ebc8edd8ebc8ec08ebcdffbaabcbcd7
EUC-JP シセシツシセシネ湿疾湿爵湿実湿゚諶シシラ 1000111010111100100011101011111010001110101111001000111011000010100011101011110010001110101111101000111010111100100011101100100010111100101111101011110011000000101111001011111010111100110111111011110010111110101111001100001010111100101111101000111011011111100011111101111010110101100011101011110010001110101111001000111011010111 8ebc8ebe8ebc8ec28ebc8ebe8ebc8ec8bcbebcc0bcbebcdfbcbebcc2bcbe8edf8fdeb58ebc8ebc8ed7
UTF-8 シセシツシセシネ湿疾湿爵湿実湿゚諶シシラ 111011111011110110111100111011111011110110111110111011111011110110111100111011111011111010000010111011111011110110111100111011111011110110111110111011111011110110111100111011111011111010001000111001101011100110111111111001111001011010111110111001101011100110111111111001111000100010110101111001101011100110111111111001011010111010011111111001101011100110111111111011111011111010011111111010001010101110110110111011111011110110111100111011111011110110111100111011111011111010010111 efbdbcefbdbeefbdbcefbe82efbdbcefbdbeefbdbcefbe88e6b9bfe796bee6b9bfe788b5e6b9bfe5ae9fe6b9bfefbe9fe8abb6efbdbcefbdbcefbe97
UHC ?????????疾?爵????諶??? 0011111100111111001111110011111100111111001111110011111100111111001111111111001011110000001111111110110111001001001111110011111100111111001111111110010010100110001111110011111100111111 3f3f3f3f3f3f3f3f3ff2f03fedc93f3f3f3fe4a63f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)