To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????[???????????[^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 張??野э?椰????[張??野э?椰????[^ 100100101010001100111111001111111001011011101100100001001000111100111111100111101011110100111111001111110011111100111111010110111001001010100011001111110011111110010110111011001000010010001111001111111001111010111101001111110011111100111111001111110101101101011110 92a33f3f96ec848f3f9ebd3f3f3f3f5b92a33f3f96ec848f3f9ebd3f3f3f3f5b5e
EUC-JP 張??野э?椰????[張??野э?椰????[^ 110001001010010100111111001111111100110011101110101001111110111100111111110111001011111100111111001111110011111100111111010110111100010010100101001111110011111111001100111011101010011111101111001111111101110010111111001111110011111100111111001111110101101101011110 c4a53f3fcceea7ef3fdcbf3f3f3f3f5bc4a53f3fcceea7ef3fdcbf3f3f3f3f5b5e
UTF-8 張욑푶野э숴椰됭짌若쨝[張욑푶野э숴椰됭짌若쨝[^ 11100101101111001011010111101100100110101001000111101101100100011011011011101001100001111000111011010001100011011110110010001000101101001110011010100100101100001110101110010000101011011110110010100111100011001110111110100101101101001110110010101000100111010101101111100101101111001011010111101100100110101001000111101101100100011011011011101001100001111000111011010001100011011110110010001000101101001110011010100100101100001110101110010000101011011110110010100111100011001110111110100101101101001110110010101000100111010101101101011110 e5bcb5ec9a91ed91b6e9878ed18dec88b4e6a4b0eb90adeca78cefa5b4eca89d5be5bcb5ec9a91ed91b6e9878ed18dec88b4e6a4b0eb90adeca78cefa5b4eca89d5b5e
UHC 張욑푶野э숴椰됭짌若쨝[張욑푶野э숴椰됭짌若쨝[^ 1110110111100101100111101110111110111110100001001110010110101111101011001110111110111101101001001110010110101011100010011110100010100011100110001110010110101110101001000111001001011011111011011110010110011110111011111011111010000100111001011010111110101100111011111011110110100100111001011010101110001001111010001010001110011000111001011010111010100100011100100101101101011110 ede59eefbe84e5afacefbda4e5ab89e8a398e5aea4725bede59eefbe84e5afacefbda4e5ab89e8a398e5aea4725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)