To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???曖?????釗??釗??釗??釗??^ 001111110011111100111111100111100100001000111111001111110011111100111111001111111111101110111011001111110011111111111011101110110011111100111111111110111011101100111111001111111111101110111011001111110011111101011110 3f3f3f9e423f3f3f3f3ffbbb3f3ffbbb3f3ffbbb3f3ffbbb3f3f5e
EUC-JP ???曖?????釗??釗??釗??釗??^ 00111111001111110011111111011011101000110011111100111111001111110011111100111111100011111110001110100110001111110011111110001111111000111010011000111111001111111000111111100011101001100011111100111111100011111110001110100110001111110011111101011110 3f3f3fdba33f3f3f3f3f8fe3a63f3f8fe3a63f3f8fe3a63f3f8fe3a63f3f5e
UTF-8 溜삘뵗曖쒌뀛溜띾졎釗숇ℓ釗숈뻐釗숁뼡釗숇젷^ 11101111101001111000101111101100100000101001100011101011101101011001011111100110100110111001011011101100100100101000110011101011100000001001101111101111101001111000101111101011100111011011111011101100101000011000111011101001100001111001011111101100100010001000011111100010100001001001001111101001100001111001011111101100100010001000100011101011101110111001000011101001100001111001011111101100100010001000000111101011101111001010000111101001100001111001011111101100100010001000011111101100101000001011011101011110 efa78bec8298ebb597e69b96ec928ceb809befa78beb9dbeeca18ee98797ec8887e28493e98797ec8888ebbb90e98797ec8881ebbca1e98797ec8887eca0b75e
UHC 溜삘뵗曖쒌뀛溜띾졎釗숇ℓ釗숈뻐釗숁뼡釗숇젷^ 11101010111111101011101111100010100101001001100111100100111100101001110011100011100001011001010011101010111111101000110111101011101000001011101111100001111100101001100111101011101001111010010011100001111100101001100111101100101110111011010111100001111100101001100111100110100101101010010011100001111100101001100111101011101000001010101101011110 eafebbe29499e4f29ce38594eafe8deba0bbe1f299eba7a4e1f299ecbbb5e1f299e696a4e1f299eba0ab5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)