To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 訝?????腋??歪や?節??嚥??椰??B 1110011001100010001111110011111100111111001111110011111111100011111111000011111100111111100110000110001110000010111000100011111110010000110111110011111100111111100110101000101100111111001111111001111010111101001111110011111101000010 e6623f3f3f3f3fe3fc3f3f986382e23f90df3f3f9a8b3f3f9ebd3f3f42
EUC-JP 訝??邕??腋??歪や?節??嚥??椰??B 11101011110000110011111100111111100011111110000111101101001111110011111111100110111111100011111100111111110011111100010010100100111001000011111111000000111000010011111100111111110100111110101100111111001111111101110010111111001111110011111101000010 ebc33f3f8fe1ed3f3fe6fe3f3fcfc4a4e43fc0e13f3fd3eb3f3fdcbf3f3f42
UTF-8 訝딉슴邕딀영腋룟쳣歪や툓節욤퍓嚥드윿椰됮궕B 11101000101010001001110111101011100101001000100111101100100010101011010011101001100000101001010111101011100101001000000011101100100110001000000111101000100001011000101111101011101000111001111111101100101100111010001111100110101011011010101011100011100000101000010011101101100010001001001111100111101011111000000011101100100110101010010011101101100011011001001111100101100110101010010111101011100100111001110011101100100111001011111111100110101001001011000011101011100100001010111011101010101101101001010101000010 e8a89deb9489ec8ab4e98295eb9480ec9881e8858beba39fecb3a3e6adaae38284ed8893e7af80ec9aa4ed8d93e59aa5eb939cec9cbfe6a4b0eb90aeeab69542
UHC 訝딉슴邕딀영腋룟쳣歪や툓節욤퍓嚥드윿椰됮궕B 11100100101110001000101011101111101111011011111111101000101110111000101011100110101111111011010111100100111111011011011111100101101010111000100111101000111000001010101011100100101110001000101011101111101111011011111111101000101110111000101011100110101111111011010111100101100111111011011111100101101010111000100111101001100000101010101001000010 e4b88aefbdbfe8bb8ae6bfb5e4fdb7e5ab89e8e0aae4b88aefbdbfe8bb8ae6bfb5e59fb7e5ab89e982aa42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)