To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 晤??潁??亦??鈺??節??節η????傲 100111011110101100111111001111111001111111110001001111110011111110010110100100100011111100111111111110111100010000111111001111111001000011011111001111110011111110010000110111111000001111000101001111110011111100111111001111111001100011111100 9deb3f3f9ff13f3f96923f3ffbc43f3f90df3f3f90df83c53f3f3f3f98fc
EUC-JP 晤??潁??亦??鈺??節??節η????傲 11011010111011010011111100111111110111101111001100111111001111111100101111110010001111110011111110001111111000111101010100111111001111111100000011100001001111110011111111000000111000011010011011000111001111110011111100111111001111111101000011111110 daed3f3fdef33f3fcbf23f3f8fe3d53f3fc0e13f3fc0e1a6c73f3f3f3fd0fe
UTF-8 晤댐쉈潁꿰빓亦껇렆鈺뤄스節배씠節η림閱룡쾬傲 1110011010011001101001001110101110001100100100001110110010001001100010001110011010111101100000011110101010111111101100001110101110111001100100111110010010111010101001101110101010111011100001111110101110100000100001101110100110001000101110101110101110100100100001001110110010001010101001001110011110101111100000001110101110110000101100001110110010010100101000001110011110101111100000001100111010110111111010111010011010111100111010011001011010110001111010111010001110100001111011001011111010101100111001011000001010110010 e699a4eb8c90ec8988e6bd81eabfb0ebb993e4baa6eabb87eba086e988baeba484ec8aa4e7af80ebb0b0ec94a0e7af80ceb7eba6bce996b1eba3a1ecbeace582b2
UHC 晤댐쉈潁꿰빓亦껇렆鈺뤄스節배씠節η림閱룡쾬傲 1110011111111011101101001110111110111101101001011110011110111000101100101110011110010101101101111110011010110010100000111110100010001110101000001110100010101101101101111110111110111101101110101110111110111101101110011110100010011101101101001110111110111101101001011110011110111000101100101110011011110011101101111110011010110010100000111110011111101100 e7fbb4efbda5e7b8b2e795b7e6b283e88ea0e8adb7efbdbaefbdb9e89db4efbda5e7b8b2e6f3b7e6b283e7ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)