To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌?????違??鶯щ????邑??筌??鍮 1110001010100011001111110011111100111111001111110011111110001000111000010011111100111111111010011111001010000100100010110011111100111111001111110011111110010111010101110011111100111111111000101010001100111111001111111110100001001010 e2a33f3f3f3f3f88e13f3fe9f2848b3f3f3f3f97573f3fe2a33f3fe84a
EUC-JP 筌?????違??鶯щ?靷??邑??筌??鍮 11100100101001010011111100111111001111110011111100111111101100001110001100111111001111111111001011110100101001111110101100111111100011111110011110111101001111110011111111001101101110000011111100111111111001001010010100111111001111111110111110101011 e4a53f3f3f3f3fb0e33f3ff2f4a7eb3f8fe7bd3f3fcdb83f3fe4a53f3fefab
UTF-8 筌뗫끂栒덅뤃違꾨졁鶯щ벩靷딉쭓邑뀁죧筌뚮벊鍮 1110011110101101100011001110101110010111101010111110101110000001100000101110011010100000100100101110101110001101100001011110101110100100100000111110100110000001100101011110101010111110101010001110110010100001100000011110100110110110101011111101000110001001111010111011001010101001111010011001110110110111111010111001010010001001111011001010110110010011111010011000001010010001111010111000000010000001111011001010001110100111111001111010110110001100111010111001101010101110111010111011001010001010111010011000110110101110 e7ad8ceb97abeb8182e6a092eb8d85eba483e98195eabea8eca181e9b6afd189ebb2a9e99db7eb9489ecad93e98291eb8081eca3a7e7ad8ceb9aaeebb28ae98dae
UHC 筌뗫끂栒덅뤃違꾨졁鶯щ벩靷딉쭓邑뀁죧筌뚮벊鍮 1110111110100111100010111110101110000101101110001110001011100011100010001110100010001111101101001110101011011110100001001110101110100000101100101110010110100011101011001110101110010011101111111110110011100110100010101110111110100111100010111110101111101001101100101110110010100001100000101110111110100111100011001110101110010011101011011110101110111001 efa78beb85b8e2e388e88fb4eade84eba0b2e5a3aceb93bfece68aefa78bebe9b2eca182efa78ceb93adebb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)