To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 蟹??寤?R蟹??寤?^[蟹??寤?R蟹??寤?^[^ 1000101001001001001111110011111110011011100010000011111101010010100010100100100100111111001111111001101110001000001111110101111001011011100010100100100100111111001111111001101110001000001111110101001010001010010010010011111100111111100110111000100000111111010111100101101101011110 8a493f3f9b883f528a493f3f9b883f5e5b8a493f3f9b883f528a493f3f9b883f5e5b5e
EUC-JP 蟹??寤?R蟹??寤?^[蟹??寤?R蟹??寤?^[^ 1011001110101010001111110011111111010101111010000011111101010010101100111010101000111111001111111101010111101000001111110101111001011011101100111010101000111111001111111101010111101000001111110101001010110011101010100011111100111111110101011110100000111111010111100101101101011110 b3aa3f3fd5e83f52b3aa3f3fd5e83f5e5bb3aa3f3fd5e83f52b3aa3f3fd5e83f5e5b5e
UTF-8 蟹딉풄寤쓓R蟹딉풄寤쓓^[蟹딉풄寤쓓R蟹딉풄寤쓓^[^ 11101000100111111011100111101011100101001000100111101101100100101000010011100101101011111010010011101100100100111001001101010010111010001001111110111001111010111001010010001001111011011001001010000100111001011010111110100100111011001001001110010011010111100101101111101000100111111011100111101011100101001000100111101101100100101000010011100101101011111010010011101100100100111001001101010010111010001001111110111001111010111001010010001001111011011001001010000100111001011010111110100100111011001001001110010011010111100101101101011110 e89fb9eb9489ed9284e5afa4ec939352e89fb9eb9489ed9284e5afa4ec93935e5be89fb9eb9489ed9284e5afa4ec939352e89fb9eb9489ed9284e5afa4ec93935e5b5e
UHC 蟹딉풄寤쓓R蟹딉풄寤쓓^[蟹딉풄寤쓓R蟹딉풄寤쓓^[^ 1111101010101111100010101110111110111110100011001110011111110101100111010110111001010010111110101010111110001010111011111011111010001100111001111111010110011101011011100101111001011011111110101010111110001010111011111011111010001100111001111111010110011101011011100101001011111010101011111000101011101111101111101000110011100111111101011001110101101110010111100101101101011110 faaf8aefbe8ce7f59d6e52faaf8aefbe8ce7f59d6e5e5bfaaf8aefbe8ce7f59d6e52faaf8aefbe8ce7f59d6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)