To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 懿??擬??酉??N}懿??擬??酉??N{^ 1001110011110010001111110011111110001011010110110011111100111111100100111101000100111111001111110100111001111101100111001111001000111111001111111000101101011011001111110011111110010011110100010011111100111111010011100111101101011110 9cf23f3f8b5b3f3f93d13f3f4e7d9cf23f3f8b5b3f3f93d13f3f4e7b5e
EUC-JP 懿??擬??酉??N}懿??擬??酉??N{^ 1101100011110100001111110011111110110101101111000011111100111111110001101101001100111111001111110100111001111101110110001111010000111111001111111011010110111100001111110011111111000110110100110011111100111111010011100111101101011110 d8f43f3fb5bc3f3fc6d33f3f4e7dd8f43f3fb5bc3f3fc6d33f3f4e7b5e
UTF-8 懿뚦뀯擬뉖쨸酉계쓼N}懿뚦뀯擬뉖쨸酉계쓼N{^ 1110011010000111101111111110101110011010101001101110101110000000101011111110011010010011101011001110101110001001100101101110110010101000101110001110100110000101100010011110101010110011100001001110110010010011101111000100111001111101111001101000011110111111111010111001101010100110111010111000000010101111111001101001001110101100111010111000100110010110111011001010100010111000111010011000010110001001111010101011001110000100111011001001001110111100010011100111101101011110 e687bfeb9aa6eb80afe693aceb8996eca8b8e98589eab384ec93bc4e7de687bfeb9aa6eb80afe693aceb8996eca8b8e98589eab384ec93bc4e7b5e
UHC 懿뚦뀯擬뉖쨸酉계쓼N}懿뚦뀯擬뉖쨸酉계쓼N{^ 1110101111110011100011001110010110000101101001011110101111110100100001111110101110100100100100101110101110110111101100001110100010011101100101110100111001111101111010111111001110001100111001011000010110100101111010111111010010000111111010111010010010010010111010111011011110110000111010001001110110010111010011100111101101011110 ebf38ce585a5ebf487eba492ebb7b0e89d974e7debf38ce585a5ebf487eba492ebb7b0e89d974e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)