To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 蠍ク鬆”蠍ク鬆’N}蠍ク鬆”蠍ク鬆’N{^ 111001011011011010111000111010011010000010000001011010001110010110110110101110001110100110100000100000010110011001001110011111011110010110110110101110001110100110100000100000010110100011100101101101101011100011101001101000001000000101100110010011100111101101011110 e5b6b8e9a08168e5b6b8e9a081664e7de5b6b8e9a08168e5b6b8e9a081664e7b5e
EUC-JP 蠍ク鬆”蠍ク鬆’N}蠍ク鬆”蠍ク鬆’N{^ 11101010101110001000111010111000111100101010001010100001110010011110101010111000100011101011100011110010101000101010000111000111010011100111110111101010101110001000111010111000111100101010001010100001110010011110101010111000100011101011100011110010101000101010000111000111010011100111101101011110 eab88eb8f2a2a1c9eab88eb8f2a2a1c74e7deab88eb8f2a2a1c9eab88eb8f2a2a1c74e7b5e
UTF-8 蠍ク鬆”蠍ク鬆’N}蠍ク鬆”蠍ク鬆’N{^ 1110100010100000100011011110111110111101101110001110100110101100100001101110001010000000100111011110100010100000100011011110111110111101101110001110100110101100100001101110001010000000100110010100111001111101111010001010000010001101111011111011110110111000111010011010110010000110111000101000000010011101111010001010000010001101111011111011110110111000111010011010110010000110111000101000000010011001010011100111101101011110 e8a08defbdb8e9ac86e2809de8a08defbdb8e9ac86e280994e7de8a08defbdb8e9ac86e2809de8a08defbdb8e9ac86e280994e7b5e
UHC ???”???’N}???”???’N{^ 00111111001111110011111110100001101100010011111100111111001111111010000110101111010011100111110100111111001111110011111110100001101100010011111100111111001111111010000110101111010011100111101101011110 3f3f3fa1b13f3f3fa1af4e7d3f3f3fa1b13f3f3fa1af4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)