To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嚥??僥??絶?????絶??B 1001101010001011001111110011111110011001010001100011111100111111100100001110001000111111001111110011111100111111001111111001000011100010001111110011111101000010 9a8b3f3f99463f3f90e23f3f3f3f3f90e23f3f42
EUC-JP 嚥??僥??絶?????絶??B 1101001111101011001111110011111111010001101001110011111100111111110000001110010000111111001111110011111100111111001111111100000011100100001111110011111101000010 d3eb3f3fd1a73f3fc0e43f3f3f3f3fc0e43f3f42
UTF-8 嚥드츒僥울풘絶쎿영若띺떯絶쏉풘B 11100101100110101010010111101011100100111001110011101100101110001001001011100101100000111010010111101100100110101011100011101101100100101001100011100111101101011011011011101100100011101011111111101100100110001000000111101111101001011011010011101011100111011011101011101011100101101010111111100111101101011011011011101100100011111000100111101101100100101001100001000010 e59aa5eb939cecb892e583a5ec9ab8ed9298e7b5b6ec8ebfec9881efa5b4eb9dbaeb96afe7b5b6ec8f89ed929842
UHC 嚥드츒僥울풘絶쎿영若띺떯絶쏉풘B 11100110101111111011010111100101101011101000110111101000111010011011111111101111101111101001101111101111101111101001101111100110101111111011010111100101101011101000110111101001100010111011111111101111101111101001101111101111101111101001101101000010 e6bfb5e5ae8de8e9bfefbe9befbe9be6bfb5e5ae8de98bbfefbe9befbe9b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)