To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 螂ェ蜈カ蜊堤ォェ螻槫、哮螂ェ蜈カ蟄倩セソ蟆 111001011010010110101010111001011000010110110110111001011000110110010010111001111010101110101010111001011011000110011110111001011010010010011010010010111110010110100101101010101110010110000101101101101110010110101101100110001110100010111110101111111110010110110000 e5a5aae585b6e58d92e7abaae5b19ee5a49a4be5a5aae585b6e5ad98e8bebfe5b0
EUC-JP 螂ェ蜈カ蜊堤ォェ螻槫、哮螂ェ蜈カ蟄倩セソ蟆 111010101010011110001110101010101110100111100101100011101011011011101001111011011100010011101001100011101010101110001110101010101110101010110011110111001110011110001110101001001101001110101100111010101010011110001110101010101110100111100101100011101011011011101010101011111101000011101010100011101011111010001110101111111110101010110010 eaa78eaae9e58eb6e9edc4e98eab8eaaeab3dce78ea4d3aceaa78eaae9e58eb6eaafd0ea8ebe8ebfeab2
UTF-8 螂ェ蜈カ蜊堤ォェ螻槫、哮螂ェ蜈カ蟄倩セソ蟆 111010001001111010000010111011111011110110101010111010001001110010001000111011111011110110110110111010001001110010001010111001011010000010100100111011111011110110101011111011111011110110101010111010001001111010111011111001101010011110101011111011111011110110100100111001011001001110101110111010001001111010000010111011111011110110101010111010001001110010001000111011111011110110110110111010001001111110000100111001011000000010101001111011111011110110111110111011111011110110111111111010001001111110000110 e89e82efbdaae89c88efbdb6e89c8ae5a0a4efbdabefbdaae89ebbe6a7abefbda4e593aee89e82efbdaae89c88efbdb6e89f84e580a9efbdbeefbdbfe89f86
UHC 螂?蜈??堤?????哮螂?蜈?蟄???? 11010101110011000011111111101000101001010011111100111111111100001010011100111111001111110011111100111111001111111111110011110110110101011100110000111111111010001010010100111111111101101101111000111111001111110011111100111111 d5cc3fe8a53f3ff0a73f3f3f3f3ffcf6d5cc3fe8a53ff6de3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)