To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????U??????????U\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101010101001111110011111100111111001111110011111100111111001111110011111100111111001111110101010101011100 3f3f3f3f3f3f3f3f3f3f553f3f3f3f3f3f3f3f3f3f555c
SJIS-WIN テカツ・テケテイテコUテカツ・テケテイテコU\ 1100001110110110110000101010010111000011101110011100001110110010110000111011101001010101110000111011011011000010101001011100001110111001110000111011001011000011101110100101010101011100 c3b6c2a5c3b9c3b2c3ba55c3b6c2a5c3b9c3b2c3ba555c
EUC-JP テカツ・テケテイテコUテカツ・テケテイテコU\ 10001110110000111000111010110110100011101100001010001110101001011000111011000011100011101011100110001110110000111000111010110010100011101100001110001110101110100101010110001110110000111000111010110110100011101100001010001110101001011000111011000011100011101011100110001110110000111000111010110010100011101100001110001110101110100101010101011100 8ec38eb68ec28ea58ec38eb98ec38eb28ec38eba558ec38eb68ec28ea58ec38eb98ec38eb28ec38eba555c
UTF-8 テカツ・テケテイテコUテカツ・テケテイテコU\ 111011111011111010000011111011111011110110110110111011111011111010000010111011111011110110100101111011111011111010000011111011111011110110111001111011111011111010000011111011111011110110110010111011111011111010000011111011111011110110111010010101011110111110111110100000111110111110111101101101101110111110111110100000101110111110111101101001011110111110111110100000111110111110111101101110011110111110111110100000111110111110111101101100101110111110111110100000111110111110111101101110100101010101011100 efbe83efbdb6efbe82efbda5efbe83efbdb9efbe83efbdb2efbe83efbdba55efbe83efbdb6efbe82efbda5efbe83efbdb9efbe83efbdb2efbe83efbdba555c
UHC ??????????U??????????U\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101010101001111110011111100111111001111110011111100111111001111110011111100111111001111110101010101011100 3f3f3f3f3f3f3f3f3f3f553f3f3f3f3f3f3f3f3f3f555c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)