To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 頡サ鬘嬋頡サ鬘嫻N}頡サ鬘嬋頡サ鬘嫻N{^ 111010001111010010111011111010011010000110011011011010001110100011110100101110111110100110100001100110110110011001001110011111011110100011110100101110111110100110100001100110110110100011101000111101001011101111101001101000011001101101100110010011100111101101011110 e8f4bbe9a19b68e8f4bbe9a19b664e7de8f4bbe9a19b68e8f4bbe9a19b664e7b5e
EUC-JP 頡サ鬘嬋頡サ鬘嫻N}頡サ鬘嬋頡サ鬘嫻N{^ 11110000111101101000111010111011111100101010001111010101110010011111000011110110100011101011101111110010101000111101010111000111010011100111110111110000111101101000111010111011111100101010001111010101110010011111000011110110100011101011101111110010101000111101010111000111010011100111101101011110 f0f68ebbf2a3d5c9f0f68ebbf2a3d5c74e7df0f68ebbf2a3d5c9f0f68ebbf2a3d5c74e7b5e
UTF-8 頡サ鬘嬋頡サ鬘嫻N}頡サ鬘嬋頡サ鬘嫻N{^ 1110100110100000101000011110111110111101101110111110100110101100100110001110010110101100100010111110100110100000101000011110111110111101101110111110100110101100100110001110010110101011101110110100111001111101111010011010000010100001111011111011110110111011111010011010110010011000111001011010110010001011111010011010000010100001111011111011110110111011111010011010110010011000111001011010101110111011010011100111101101011110 e9a0a1efbdbbe9ac98e5ac8be9a0a1efbdbbe9ac98e5abbb4e7de9a0a1efbdbbe9ac98e5ac8be9a0a1efbdbbe9ac98e5abbb4e7b5e
UHC ???嬋????N}???嬋????N{^ 0011111100111111001111111110000010111101001111110011111100111111001111110100111001111101001111110011111100111111111000001011110100111111001111110011111100111111010011100111101101011110 3f3f3fe0bd3f3f3f3f4e7d3f3f3fe0bd3f3f3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)