To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???B?????????B??????? 001111110011111100111111010000100011111100111111001111110011111100111111001111110011111100111111001111110100001000111111001111110011111100111111001111110011111100111111 3f3f3f423f3f3f3f3f3f3f3f3f423f3f3f3f3f3f3f
SJIS-WIN テ猟「Bツ渉サテウテ氾猟「Bツ渉サテウテ韮 110000111001011111000010101000100100001011000010100011111100001010111011110000111011001111000011100101001100001110010111110000101010001001000010110000101000111111000010101110111100001110110011110000111001010001000010 c397c2a242c28fc2bbc3b3c394c397c2a242c28fc2bbc3b3c39442
EUC-JP テ猟「Bツ渉サテウテ氾猟「Bツ渉サテウテ韮 10001110110000111100111011000100100011101010001001000010100011101100001010111110110001001000111010111011100011101100001110001110101100111000111011000011110010001100010111001110110001001000111010100010010000101000111011000010101111101100010010001110101110111000111011000011100011101011001110001110110000111100011110100011 8ec3cec48ea2428ec2bec48ebb8ec38eb38ec3c8c5cec48ea2428ec2bec48ebb8ec38eb38ec3c7a3
UTF-8 テ猟「Bツ渉サテウテ氾猟「Bツ渉サテウテ韮 1110111110111110100000111110011110001100100111111110111110111101101000100100001011101111101111101000001011100110101110001000100111101111101111011011101111101111101111101000001111101111101111011011001111101111101111101000001111100110101100001011111011100111100011001001111111101111101111011010001001000010111011111011111010000010111001101011100010001001111011111011110110111011111011111011111010000011111011111011110110110011111011111011111010000011111010011001111110101110 efbe83e78c9fefbda242efbe82e6b889efbdbbefbe83efbdb3efbe83e6b0bee78c9fefbda242efbe82e6b889efbdbbefbe83efbdb3efbe83e99fae
UHC ???B??????氾??B??????? 00111111001111110011111101000010001111110011111100111111001111110011111100111111110110111111000000111111001111110100001000111111001111110011111100111111001111110011111100111111 3f3f3f423f3f3f3f3f3fdbf03f3f423f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)