To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 逶也怱菫セ逶也怱逍シ逶也怱菫セ逶也怱逍シB 11100111100110111001011011100111100111001000010011100100101111111011111011100111100110111001011011100111100111001000010011100111100101101011110011100111100110111001011011100111100111001000010011100100101111111011111011100111100110111001011011100111100111001000010011100111100101101011110001000010 e79b96e79c84e4bfbee79b96e79c84e796bce79b96e79c84e4bfbee79b96e79c84e796bc42
EUC-JP 逶也怱菫セ逶也怱逍シ逶也怱菫セ逶也怱逍シB 1110110111111011110011001110100111010111111001001110100011000001100011101011111011101101111110111100110011101001110101111110010011101101111101101000111010111100111011011111101111001100111010011101011111100100111010001100000110001110101111101110110111111011110011001110100111010111111001001110110111110110100011101011110001000010 edfbcce9d7e4e8c18ebeedfbcce9d7e4edf68ebcedfbcce9d7e4e8c18ebeedfbcce9d7e4edf68ebc42
UTF-8 逶也怱菫セ逶也怱逍シ逶也怱菫セ逶也怱逍シB 11101001100000001011011011100100101110011001111111100110100000001011000111101000100011111010101111101111101111011011111011101001100000001011011011100100101110011001111111100110100000001011000111101001100000001000110111101111101111011011110011101001100000001011011011100100101110011001111111100110100000001011000111101000100011111010101111101111101111011011111011101001100000001011011011100100101110011001111111100110100000001011000111101001100000001000110111101111101111011011110001000010 e980b6e4b99fe680b1e88fabefbdbee980b6e4b99fe680b1e9808defbdbce980b6e4b99fe680b1e88fabefbdbee980b6e4b99fe680b1e9808defbdbc42
UHC ?也?菫??也?逍??也?菫??也?逍?B 0011111111100101101001010011111111010000110010110011111100111111111001011010010100111111111000011100111000111111001111111110010110100101001111111101000011001011001111110011111111100101101001010011111111100001110011100011111101000010 3fe5a53fd0cb3f3fe5a53fe1ce3f3fe5a53fd0cb3f3fe5a53fe1ce3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)