To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????A???????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010000010011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f413f3f3f3f3f3f3f3f
SJIS-WIN テァツ債サテァツ債湘ァツ個セAティツ堕ケテァツ 11000011101001111100001010001101110000101011101111000011101001111100001010001101110000101000111111000011101001111100001010001100110000101011111001000001110000111010100011000010100100011100001010111001110000111010011111000010 c3a7c28dc2bbc3a7c28dc28fc3a7c28cc2be41c3a8c291c2b9c3a7c2
EUC-JP テァツ債サテァツ債湘ァツ個セAティツ堕ケテァツ 100011101100001110001110101001111000111011000010101110101100010010001110101110111000111011000011100011101010011110001110110000101011101011000100101111101100010110001110101001111000111011000010101110001100010010001110101111100100000110001110110000111000111010101000100011101100001011000010110001001000111010111001100011101100001110001110101001111000111011000010 8ec38ea78ec2bac48ebb8ec38ea78ec2bac4bec58ea78ec2b8c48ebe418ec38ea88ec2c2c48eb98ec38ea78ec2
UTF-8 テァツ債サテァツ債湘ァツ個セAティツ堕ケテァツ 11101111101111101000001111101111101111011010011111101111101111101000001011100101100000101011010111101111101111011011101111101111101111101000001111101111101111011010011111101111101111101000001011100101100000101011010111100110101110011001100011101111101111011010011111101111101111101000001011100101100000001000101111101111101111011011111001000001111011111011111010000011111011111011110110101000111011111011111010000010111001011010000010010101111011111011110110111001111011111011111010000011111011111011110110100111111011111011111010000010 efbe83efbda7efbe82e582b5efbdbbefbe83efbda7efbe82e582b5e6b998efbda7efbe82e5808befbdbe41efbe83efbda8efbe82e5a095efbdb9efbe83efbda7efbe82
UHC ???債????債湘??個?A???????? 001111110011111100111111111100111111000000111111001111110011111100111111111100111111000011011111110011110011111100111111110010111100000100111111010000010011111100111111001111110011111100111111001111110011111100111111 3f3f3ff3f03f3f3f3ff3f0dfcf3f3fcbc13f413f3f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)