To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 鉐ャ閧敬鉐ャ閧掲N}鉐ャ閧敬鉐ャ閧掲N{^ 111001111110111110101100111010001000001010001100011010001110011111101111101011001110100010000010100011000110011001001110011111011110011111101111101011001110100010000010100011000110100011100111111011111010110011101000100000101000110001100110010011100111101101011110 e7eface8828c68e7eface8828c664e7de7eface8828c68e7eface8828c664e7b5e
EUC-JP 鉐ャ閧敬鉐ャ閧掲N}鉐ャ閧敬鉐ャ閧掲N{^ 11101110111100011000111010101100111011111110001010110111110010011110111011110001100011101010110011101111111000101011011111000111010011100111110111101110111100011000111010101100111011111110001010110111110010011110111011110001100011101010110011101111111000101011011111000111010011100111101101011110 eef18eacefe2b7c9eef18eacefe2b7c74e7deef18eacefe2b7c9eef18eacefe2b7c74e7b5e
UTF-8 鉐ャ閧敬鉐ャ閧掲N}鉐ャ閧敬鉐ャ閧掲N{^ 1110100110001001100100001110111110111101101011001110100110010110101001111110011010010101101011001110100110001001100100001110111110111101101011001110100110010110101001111110011010001110101100100100111001111101111010011000100110010000111011111011110110101100111010011001011010100111111001101001010110101100111010011000100110010000111011111011110110101100111010011001011010100111111001101000111010110010010011100111101101011110 e98990efbdace996a7e695ace98990efbdace996a7e68eb24e7de98990efbdace996a7e695ace98990efbdace996a7e68eb24e7b5e
UHC ???敬????N}???敬????N{^ 0011111100111111001111111100110011010111001111110011111100111111001111110100111001111101001111110011111100111111110011001101011100111111001111110011111100111111010011100111101101011110 3f3f3fccd73f3f3f3f4e7d3f3f3fccd73f3f3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)