To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 臧???臧???N}臧???臧???N{^ 11100100011010000011111100111111001111111110010001101000001111110011111100111111010011100111110111100100011010000011111100111111001111111110010001101000001111110011111100111111010011100111101101011110 e4683f3f3fe4683f3f3f4e7de4683f3f3fe4683f3f3f4e7b5e
EUC-JP 臧?蔣?臧?蔣?N}臧?蔣?臧?蔣?N{^ 111001111100100100111111100011111101100110110110001111111110011111001001001111111000111111011001101101100011111101001110011111011110011111001001001111111000111111011001101101100011111111100111110010010011111110001111110110011011011000111111010011100111101101011110 e7c93f8fd9b63fe7c93f8fd9b63f4e7de7c93f8fd9b63fe7c93f8fd9b63f4e7b5e
UTF-8 臧렧蔣궜臧렧蔣권N}臧렧蔣궜臧렧蔣권N{^ 1110100010000111101001111110101110100000101001111110100010010100101000111110101010110110100111001110100010000111101001111110101110100000101001111110100010010100101000111110101010110110100011000100111001111101111010001000011110100111111010111010000010100111111010001001010010100011111010101011011010011100111010001000011110100111111010111010000010100111111010001001010010100011111010101011011010001100010011100111101101011110 e887a7eba0a7e894a3eab69ce887a7eba0a7e894a3eab68c4e7de887a7eba0a7e894a3eab69ce887a7eba0a7e894a3eab68c4e7b5e
UHC 臧렧蔣궜臧렧蔣권N}臧렧蔣궜臧렧蔣권N{^ 11101101111101011000111010110110111011011111100010110001110010011110110111110101100011101011011011101101111110001011000111000111010011100111110111101101111101011000111010110110111011011111100010110001110010011110110111110101100011101011011011101101111110001011000111000111010011100111101101011110 edf58eb6edf8b1c9edf58eb6edf8b1c74e7dedf58eb6edf8b1c9edf58eb6edf8b1c74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)