To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??踰→????艶l????純??哀??? 1001100001100000001111110011111111100110111110101000000110101000001111110011111100111111001111111000100110010000100000101000110000111111001111110011111100111111100011111000001100111111001111111000100010100011001111110011111100111111 98603f3fe6fa81a83f3f3f3f8990828c3f3f3f3f8f833f3f88a33f3f3f
EUC-JP 倭??踰→????艶l????純??哀??? 1100111111000001001111110011111111101100111111001010001010101010001111110011111100111111001111111011000111110000101000111110110000111111001111110011111100111111101111011110001100111111001111111011000010100101001111110011111100111111 cfc13f3fecfca2aa3f3f3f3fb1f0a3ec3f3f3f3fbde33f3fb0a53f3f3f
UTF-8 倭녾낮踰→룚硫⑹묾艶l꼷留뗥윜純볩폍哀잙씤柳 111001011000000010101101111010111000010110111110111010111000001010101110111010001011100010110000111000101000011010010010111010111010001110011010111011111010011110001110111000101001000110111001111010111010110010111110111010001000100110110110111011111011110110001100111010101011110010110111111011111010011110001101111010111001011110100101111011001001110010011100111001111011010010010100111010111011001110101001111011011000111110001101111001011001001110000000111011001001111010011001111011001001010010100100111011111010011110001001 e580adeb85beeb82aee8b8b0e28692eba39aefa78ee291b9ebacbee889b6efbd8ceabcb7efa78deb97a5ec9c9ce7b494ebb3a9ed8f8de59380ec9e99ec94a4efa789
UHC 倭녾낮踰→룚硫⑹묾艶l꼷留뗥윜純볩폍哀잙씤柳 1110100011011110100001101110101010110011101101111110101110110010101000011110011010001111100101101110101110101001101010011110110010111001101100101110011011111101101000111110110010000100100011111110101110100111100010111110010110011111100111111110001011101101100100111110111110111100100110001110010011101110100111111110101110011101101110001110101011110111 e8de86eab3b7ebb2a1e68f96eba9a9ecb9b2e6fda3ec848feba78be59f9fe2ed93efbc98e4ee9feb9db8eaf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)