To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN テ「テヲツ湘ィツ沓テ」テョツ仰仰債」テヲ 11000011101000101100001110100110110000101000111111000011101010001100001010001100010000101100001110100011110000111010111011000010100010111100001010001011110000101000110111000010101000111100001110100110 c3a2c3a6c28fc3a8c28c42c3a3c3aec28bc28bc28dc2a3c3a6
EUC-JP テ「テヲツ湘ィツ沓テ」テョツ仰仰債」テヲ 10001110110000111000111010100010100011101100001110001110101001101000111011000010101111101100010110001110101010001000111011000010101101111010001110001110110000111000111010100011100011101100001110001110101011101000111011000010101101101100010010110110110001001011101011000100100011101010001110001110110000111000111010100110 8ec38ea28ec38ea68ec2bec58ea88ec2b7a38ec38ea38ec38eae8ec2b6c4b6c4bac48ea38ec38ea6
UTF-8 テ「テヲツ湘ィツ沓テ」テョツ仰仰債」テヲ 111011111011111010000011111011111011110110100010111011111011111010000011111011111011110110100110111011111011111010000010111001101011100110011000111011111011110110101000111011111011111010000010111001101011001010010011111011111011111010000011111011111011110110100011111011111011111010000011111011111011110110101110111011111011111010000010111001001011101110110000111001001011101110110000111001011000001010110101111011111011110110100011111011111011111010000011111011111011110110100110 efbe83efbda2efbe83efbda6efbe82e6b998efbda8efbe82e6b293efbe83efbda3efbe83efbdaeefbe82e4bbb0e4bbb0e582b5efbda3efbe83efbda6
UHC ?????湘??沓?????仰仰債??? 00111111001111110011111100111111001111111101111111001111001111110011111111010011110010110011111100111111001111110011111100111111111001001110011011100100111001101111001111110000001111110011111100111111 3f3f3f3f3fdfcf3f3fd3cb3f3f3f3f3fe4e6e4e6f3f03f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)