To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?紆??紗?寃??拒淨?紆??紗?寃??居^ 100111111100010000111111111000101111110000111111001111111000111011010001001111111001101110000011001111110011111110001011100100011001111111000100001111111110001011111100001111110011111110001110110100010011111110011011100000110011111100111111100010111000111101011110 9fc43fe2fc3f3f8ed13f9b833f3f8b919fc43fe2fc3f3f8ed13f9b833f3f8b8f5e
EUC-JP 淨?紆?焌紗?寃??拒淨?紆?焌紗?寃??居^ 11011110110001100011111111100100111111100011111110001111110010011110100010111100110100110011111111010101111000110011111100111111101101011111000111011110110001100011111111100100111111100011111110001111110010011110100010111100110100110011111111010101111000110011111100111111101101011110111101011110 dec63fe4fe3f8fc9e8bcd33fd5e33f3fb5f1dec63fe4fe3f8fc9e8bcd33fd5e33f3fb5ef5e
UTF-8 淨렠紆렣焌紗떵寃닿렋拒淨렠紆렣焌紗떵寃닿렋居^ 11100110101101111010100011101011101000001010000011100111101101001000011011101011101000001010001111100111100001001000110011100111101101001001011111101011100101101011010111100101101011111000001111101011100010111011111111101011101000001000101111100110100010111001001011100110101101111010100011101011101000001010000011100111101101001000011011101011101000001010001111100111100001001000110011100111101101001001011111101011100101101011010111100101101011111000001111101011100010111011111111101011101000001000101111100101101100011000010101011110 e6b7a8eba0a0e7b486eba0a3e7848ce7b497eb96b5e5af83eb8bbfeba08be68b92e6b7a8eba0a0e7b486eba0a3e7848ce7b497eb96b5e5af83eb8bbfeba08be5b1855e
UHC 淨렠紆렣焌紗떵寃닿렋拒淨렠紆렣焌紗떵寃닿렋居^ 111011111110010010001110101100011110100111100001100011101011010011110001111000001101111011101001101101101011101011101010101100101011010011101010100011101010001011001011110111101110111111100100100011101011000111101001111000011000111010110100111100011110000011011110111010011011011010111010111010101011001010110100111010101000111010100010110010111101110001011110 efe48eb1e9e18eb4f1e0dee9b6baeab2b4ea8ea2cbdeefe48eb1e9e18eb4f1e0dee9b6baeab2b4ea8ea2cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)