To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????h????????????k 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101011 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f3f6b
SJIS-WIN テ・ツカツエティツョツ擢hテ・ツカツエティツョツ擢k 11000011101001011100001010110110110000101011010011000011101010001100001010101110110000101001001101000110011010001100001110100101110000101011011011000010101101001100001110101000110000101010111011000010100100110100011001101011 c3a5c2b6c2b4c3a8c2aec2934668c3a5c2b6c2b4c3a8c2aec293466b
EUC-JP テ・ツカツエティツョツ擢hテ・ツカツエティツョツ擢k 1000111011000011100011101010010110001110110000101000111010110110100011101100001010001110101101001000111011000011100011101010100010001110110000101000111010101110100011101100001011000101101001110110100010001110110000111000111010100101100011101100001010001110101101101000111011000010100011101011010010001110110000111000111010101000100011101100001010001110101011101000111011000010110001011010011101101011 8ec38ea58ec28eb68ec28eb48ec38ea88ec28eae8ec2c5a7688ec38ea58ec28eb68ec28eb48ec38ea88ec28eae8ec2c5a76b
UTF-8 テ・ツカツエティツョツ擢hテ・ツカツエティツョツ擢k 1110111110111110100000111110111110111101101001011110111110111110100000101110111110111101101101101110111110111110100000101110111110111101101101001110111110111110100000111110111110111101101010001110111110111110100000101110111110111101101011101110111110111110100000101110011010010011101000100110100011101111101111101000001111101111101111011010010111101111101111101000001011101111101111011011011011101111101111101000001011101111101111011011010011101111101111101000001111101111101111011010100011101111101111101000001011101111101111011010111011101111101111101000001011100110100100111010001001101011 efbe83efbda5efbe82efbdb6efbe82efbdb4efbe83efbda8efbe82efbdaeefbe82e693a268efbe83efbda5efbe82efbdb6efbe82efbdb4efbe83efbda8efbe82efbdaeefbe82e693a26b
UHC ???????????擢h???????????擢k 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111111011011110111011010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111111101101111011101101011 3f3f3f3f3f3f3f3f3f3f3ff6f7683f3f3f3f3f3f3f3f3f3f3ff6f76b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)