To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????r}v????????r}vB 0011111100111111001111110011111100111111001111110011111100111111011100100111110101110110001111110011111100111111001111110011111100111111001111110011111101110010011111010111011001000010 3f3f3f3f3f3f3f3f727d763f3f3f3f3f3f3f3f727d7642
SJIS-WIN ツ妥セツ堕アツ但r}vツ妥セツ堕アツ但r}vB 1100001010010001110000111011111011000010100100011100001010110001110000101001001001000001011100100111110101110110110000101001000111000011101111101100001010010001110000101011000111000010100100100100000101110010011111010111011001000010 c291c3bec291c2b1c29241727d76c291c3bec291c2b1c29241727d7642
EUC-JP ツ妥セツ堕アツ但r}vツ妥セツ堕アツ但r}vB 100011101100001011000010110001011000111010111110100011101100001011000010110001001000111010110001100011101100001011000011101000100111001001111101011101101000111011000010110000101100010110001110101111101000111011000010110000101100010010001110101100011000111011000010110000111010001001110010011111010111011001000010 8ec2c2c58ebe8ec2c2c48eb18ec2c3a2727d768ec2c2c58ebe8ec2c2c48eb18ec2c3a2727d7642
UTF-8 ツ妥セツ堕アツ但r}vツ妥セツ堕アツ但r}vB 11101111101111101000001011100101101001101010010111101111101111011011111011101111101111101000001011100101101000001001010111101111101111011011000111101111101111101000001011100100101111011000011001110010011111010111011011101111101111101000001011100101101001101010010111101111101111011011111011101111101111101000001011100101101000001001010111101111101111011011000111101111101111101000001011100100101111011000011001110010011111010111011001000010 efbe82e5a6a5efbdbeefbe82e5a095efbdb1efbe82e4bd86727d76efbe82e5a6a5efbdbeefbe82e5a095efbdb1efbe82e4bd86727d7642
UHC ?妥?????但r}v?妥?????但r}vB 001111111111011011100110001111110011111100111111001111110011111111010011101000110111001001111101011101100011111111110110111001100011111100111111001111110011111100111111110100111010001101110010011111010111011001000010 3ff6e63f3f3f3f3fd3a3727d763ff6e63f3f3f3f3fd3a3727d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)