To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8諭??汲???碎⑦?暗??倚?? 111000011001111100111111100000100101011110010111010000000011111100111111100010111000001000111111001111110011111111100001111010101000011101000110001111111000100011000011001111110011111110011000110111110011111100111111 e19f3f825797403f3f8b823f3f3fe1ea87463f88c33f3f98df3f3f
EUC-JP 癲?8諭??汲庾??碎??暗??倚?? 11100010101000010011111110100011101110001100110110100001001111110011111110110101111000101000111110111100110011100011111100111111111000101110110000111111001111111011000011000101001111110011111111010000111000010011111100111111 e2a13fa3b8cda13f3fb5e28fbcce3f3fe2ec3f3fb0c53f3fd0e13f3f
UTF-8 癲쒕8諭띈맱汲庾곮쥈碎⑦룍暗삳봾倚묈쳞 111001111001100110110010111011001001001010010101111011111011110010011000111010001010101110101101111010111001110110001000111010111010011110110001111001101011000110110010111001011011101010111110111010101011001110101110111011001010010110001000111001111010001010001110111000101001000110100110111010111010001110001101111001101001101010010111111011001000001010110011111010111011010010111110111001011000000010011010111010111010110010001000111011001011001110011110 e799b2ec9295efbc98e8abadeb9d88eba7b1e6b1b2e5babeeab3aeeca588e7a28ee291a6eba38de69a97ec82b3ebb4bee5809aebac88ecb39e
UHC 癲쒕8諭띈맱汲庾곮쥈碎⑦룍暗삳봾倚묈쳞 1110111110100110100111001110101110100011101110001110101110110001101101101110100010010000101110001101000011100011111010101110110010000001111010001010001010000001111000011110111110101000111011011000111110001011111001001101111010111011111010111001010010000101111010111110111110010001111001011010101110000100 efa69ceba3b8ebb1b6e890b8d0e3eaec81e8a281e1efa8ed8f8be4debbeb9485ebef91e5ab84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)