To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^h????^fN}????^h????^fN{^ 0011111100111111001111110011111101011110011010000011111100111111001111110011111101011110011001100100111001111101001111110011111100111111001111110101111001101000001111110011111100111111001111110101111001100110010011100111101101011110 3f3f3f3f5e683f3f3f3f5e664e7d3f3f3f3f5e683f3f3f3f5e664e7b5e
SJIS-WIN 張貊??^h張貊??^fN}張貊??^h張貊??^fN{^ 10010010101000111110011010111011001111110011111101011110011010001001001010100011111001101011101100111111001111110101111001100110010011100111110110010010101000111110011010111011001111110011111101011110011010001001001010100011111001101011101100111111001111110101111001100110010011100111101101011110 92a3e6bb3f3f5e6892a3e6bb3f3f5e664e7d92a3e6bb3f3f5e6892a3e6bb3f3f5e664e7b5e
EUC-JP 張貊??^h張貊??^fN}張貊??^h張貊??^fN{^ 11000100101001011110110010111101001111110011111101011110011010001100010010100101111011001011110100111111001111110101111001100110010011100111110111000100101001011110110010111101001111110011111101011110011010001100010010100101111011001011110100111111001111110101111001100110010011100111101101011110 c4a5ecbd3f3f5e68c4a5ecbd3f3f5e664e7dc4a5ecbd3f3f5e68c4a5ecbd3f3f5e664e7b5e
UTF-8 張貊렎렠^h張貊렎렠^fN}張貊렎렠^h張貊렎렠^fN{^ 11100101101111001011010111101000101100101000101011101011101000001000111011101011101000001010000001011110011010001110010110111100101101011110100010110010100010101110101110100000100011101110101110100000101000000101111001100110010011100111110111100101101111001011010111101000101100101000101011101011101000001000111011101011101000001010000001011110011010001110010110111100101101011110100010110010100010101110101110100000100011101110101110100000101000000101111001100110010011100111101101011110 e5bcb5e8b28aeba08eeba0a05e68e5bcb5e8b28aeba08eeba0a05e664e7de5bcb5e8b28aeba08eeba0a05e68e5bcb5e8b28aeba08eeba0a05e664e7b5e
UHC 張貊렎렠^h張貊렎렠^fN}張貊렎렠^h張貊렎렠^fN{^ 111011011110010111011000111001111000111010100100100011101011000101011110011010001110110111100101110110001110011110001110101001001000111010110001010111100110011001001110011111011110110111100101110110001110011110001110101001001000111010110001010111100110100011101101111001011101100011100111100011101010010010001110101100010101111001100110010011100111101101011110 ede5d8e78ea48eb15e68ede5d8e78ea48eb15e664e7dede5d8e78ea48eb15e68ede5d8e78ea48eb15e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)