To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??G?????W??????G?????W????E 001111110011111101000111001111110011111100111111001111110011111101010111001111110011111100111111001111110011111100111111010001110011111100111111001111110011111100111111010101110011111100111111001111110011111101000101 3f3f473f3f3f3f3f573f3f3f3f3f3f473f3f3f3f3f573f3f3f3f45
SJIS-WIN テサGツ篠シテサWツ。テ杠テサGツ篠シテサWツ。テ枌E 11000011101110110100011111000010100011101100001010111100110000111011101101010111110000101010000111000011100111100101100111000011101110110100011111000010100011101100001010111100110000111011101101010111110000101010000111000011100111100110001001000101 c3bb47c28ec2bcc3bb57c2a1c39e59c3bb47c28ec2bcc3bb57c2a1c39e6245
EUC-JP テサGツ篠シテサWツ。テ杠テサGツ篠シテサWツ。テ枌E 10001110110000111000111010111011010001111000111011000010101111001100010010001110101111001000111011000011100011101011101101010111100011101100001010001110101000011000111011000011110110111011101010001110110000111000111010111011010001111000111011000010101111001100010010001110101111001000111011000011100011101011101101010111100011101100001010001110101000011000111011000011110110111100001101000101 8ec38ebb478ec2bcc48ebc8ec38ebb578ec28ea18ec3dbba8ec38ebb478ec2bcc48ebc8ec38ebb578ec28ea18ec3dbc345
UTF-8 テサGツ篠シテサWツ。テ杠テサGツ篠シテサWツ。テ枌E 1110111110111110100000111110111110111101101110110100011111101111101111101000001011100111101011111010000011101111101111011011110011101111101111101000001111101111101111011011101101010111111011111011111010000010111011111011110110100001111011111011111010000011111001101001110110100000111011111011111010000011111011111011110110111011010001111110111110111110100000101110011110101111101000001110111110111101101111001110111110111110100000111110111110111101101110110101011111101111101111101000001011101111101111011010000111101111101111101000001111100110100111101000110001000101 efbe83efbdbb47efbe82e7afa0efbdbcefbe83efbdbb57efbe82efbda1efbe83e69da0efbe83efbdbb47efbe82e7afa0efbdbcefbe83efbdbb57efbe82efbda1efbe83e69e8c45
UHC ??G?篠???W??????G?篠???W????E 0011111100111111010001110011111111100001110001100011111100111111001111110101011100111111001111110011111100111111001111110011111101000111001111111110000111000110001111110011111100111111010101110011111100111111001111110011111101000101 3f3f473fe1c63f3f3f573f3f3f3f3f3f473fe1c63f3f3f573f3f3f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)