To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????遙??節??徇??節?ゴ央??? 00111111001111110011111100111111001111110011111111101010101000010011111100111111100100001101111100111111001111111001110001101101001111110011111110010000110111110011111110000011010100111000100110011011001111110011111100111111 3f3f3f3f3f3feaa13f3f90df3f3f9c6d3f3f90df3f8353899b3f3f3f
EUC-JP ???縕??遙??節??徇??節?ゴ央??? 001111110011111100111111100011111101010011000010001111110011111111110100101000110011111100111111110000001110000100111111001111111101011111001110001111110011111111000000111000010011111110100101101101001011000111111011001111110011111100111111 3f3f3f8fd4c23f3ff4a33f3fc0e13f3fd7ce3f3fc0e13fa5b4b1fb3f3f3f
UTF-8 娛곤슉縕됵슴遙닷돭節얏㉥徇쀨쾳節욥ゴ央곤슭呂 111001011010100010011011111010101011001110100100111011001000101010001001111001111011100010010101111010111001000010110101111011001000101010110100111010011000000110011001111010111000101110110111111010111000111110101101111001111010111110000000111011001001011010001111111000111000100110100101111001011011111010000111111011001000000010101000111011001011111010110011111001111010111110000000111011001001101010100101111000111000001010110100111001011010010010101110111010101011001110100100111011001000101010101101111011111010011010000000 e5a89beab3a4ec8a89e7b895eb90b5ec8ab4e98199eb8bb7eb8fade7af80ec968fe389a5e5be87ec80a8ecbeb3e7af80ec9aa5e382b4e5a4aeeab3a4ec8aadefa680
UHC 娛곤슉縕됵슴遙닷돭節얏㉥徇쀨쾳節욥ゴ央곤슭呂 1110011111110100101100001110111110111101101101011110100010110010100010011110111110111101101111111110100110101011101101001110010110001001101100001110111110111101101111101110011010101000101101101110001011011111100101111110100010110010100010011110111110111101101111111110100110101011101101001110010011100111101100001110111110111101101111101110010111111011 e7f4b0efbdb5e8b289efbdbfe9abb4e589b0efbdbee6a8b6e2df97e8b289efbdbfe9abb4e4e7b0efbdbee5fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)