To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 址?諍???畯?n}址?諍???畯?n{^ 100110101010110000111111111001100111100100111111001111110011111111111011011011110011111101101110011111011001101010101100001111111110011001111001001111110011111100111111111110110110111100111111011011100111101101011110 9aac3fe6793f3f3ffb6f3f6e7d9aac3fe6793f3f3ffb6f3f6e7b5e
EUC-JP 址?諍???畯?n}址?諍???畯?n{^ 1101010010101110001111111110101111011010001111110011111100111111100011111100110110111011001111110110111001111101110101001010111000111111111010111101101000111111001111110011111110001111110011011011101100111111011011100111101101011110 d4ae3febda3f3f3f8fcdbb3f6e7dd4ae3febda3f3f3f8fcdbb3f6e7b5e
UTF-8 址렞諍쇰렊렞畯렩n}址렞諍쇰렊렞畯렩n{^ 1110010110011101100000001110101110100000100111101110100010101011100011011110110010000111101100001110101110100000100010101110101110100000100111101110011110010101101011111110101110100000101010010110111001111101111001011001110110000000111010111010000010011110111010001010101110001101111011001000011110110000111010111010000010001010111010111010000010011110111001111001010110101111111010111010000010101001011011100111101101011110 e59d80eba09ee8ab8dec87b0eba08aeba09ee795afeba0a96e7de59d80eba09ee8ab8dec87b0eba08aeba09ee795afeba0a96e7b5e
UHC 址렞諍쇰렊렞畯렩n}址렞諍쇰렊렞畯렩n{^ 11110010101000111000111010101111111011101011010110111100111010111000111010100001100011101010111111110001111000011000111010110111011011100111110111110010101000111000111010101111111011101011010110111100111010111000111010100001100011101010111111110001111000011000111010110111011011100111101101011110 f2a38eafeeb5bceb8ea18eaff1e18eb76e7df2a38eafeeb5bceb8ea18eaff1e18eb76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)