To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 髢サ鬘嬋髢サ鬘嫻N}髢サ鬘嬋髢サ鬘嫻N{^ 111010011001011010111011111010011010000110011011011010001110100110010110101110111110100110100001100110110110011001001110011111011110100110010110101110111110100110100001100110110110100011101001100101101011101111101001101000011001101101100110010011100111101101011110 e996bbe9a19b68e996bbe9a19b664e7de996bbe9a19b68e996bbe9a19b664e7b5e
EUC-JP 髢サ鬘嬋髢サ鬘嫻N}髢サ鬘嬋髢サ鬘嫻N{^ 11110001111101101000111010111011111100101010001111010101110010011111000111110110100011101011101111110010101000111101010111000111010011100111110111110001111101101000111010111011111100101010001111010101110010011111000111110110100011101011101111110010101000111101010111000111010011100111101101011110 f1f68ebbf2a3d5c9f1f68ebbf2a3d5c74e7df1f68ebbf2a3d5c9f1f68ebbf2a3d5c74e7b5e
UTF-8 髢サ鬘嬋髢サ鬘嫻N}髢サ鬘嬋髢サ鬘嫻N{^ 1110100110101011101000101110111110111101101110111110100110101100100110001110010110101100100010111110100110101011101000101110111110111101101110111110100110101100100110001110010110101011101110110100111001111101111010011010101110100010111011111011110110111011111010011010110010011000111001011010110010001011111010011010101110100010111011111011110110111011111010011010110010011000111001011010101110111011010011100111101101011110 e9aba2efbdbbe9ac98e5ac8be9aba2efbdbbe9ac98e5abbb4e7de9aba2efbdbbe9ac98e5ac8be9aba2efbdbbe9ac98e5abbb4e7b5e
UHC ???嬋????N}???嬋????N{^ 0011111100111111001111111110000010111101001111110011111100111111001111110100111001111101001111110011111100111111111000001011110100111111001111110011111100111111010011100111101101011110 3f3f3fe0bd3f3f3f3f4e7d3f3f3fe0bd3f3f3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)