To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遑晢セ鍋明訒サ逎(遑晢セ鍋明訒サ逎(B 111001111010000110011101111011111011111010010011111001111001011010111110111110111010001110111011111001111010001110000001011010011110011110100001100111011110111110111110100100111110011110010110101111101111101110100011101110111110011110100011100000010110100101000010 e7a19defbe93e796befba3bbe7a38169e7a19defbe93e796befba3bbe7a3816942
EUC-JP 遑晢セ鍋明訒サ逎(遑晢セ鍋明訒サ逎(B 111011101010001111011010111100011000111010111110110001101110100111001100110000001000111111011101110010001000111010111011111011101010010110100001110010101110111010100011110110101111000110001110101111101100011011101001110011001100000010001111110111011100100010001110101110111110111010100101101000011100101001000010 eea3daf18ebec6e9ccc08fddc88ebbeea5a1caeea3daf18ebec6e9ccc08fddc88ebbeea5a1ca42
UTF-8 遑晢セ鍋明訒サ逎(遑晢セ鍋明訒サ逎(B 11101001100000011001000111100110100110011010001011101111101111011011111011101001100011011000101111100110100110001000111011101000101010001001001011101111101111011011101111101001100000001000111011101111101111001000100011101001100000011001000111100110100110011010001011101111101111011011111011101001100011011000101111100110100110001000111011101000101010001001001011101111101111011011101111101001100000001000111011101111101111001000100001000010 e98191e699a2efbdbee98d8be6988ee8a892efbdbbe9808eefbc88e98191e699a2efbdbee98d8be6988ee8a892efbdbbe9808eefbc8842
UHC 遑??鍋明???(遑??鍋明???(B 111111001101101000111111001111111100111010100111110110011010010100111111001111110011111110100011101010001111110011011010001111110011111111001110101001111101100110100101001111110011111100111111101000111010100001000010 fcda3f3fcea7d9a53f3f3fa3a8fcda3f3fcea7d9a53f3f3fa3a842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)