To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN ツ堕コツ堕スツ短ツ端vツ堕コツ堕スツ短ツ端vB 11000010100100011100001010111010110000101001000111000010101111011100001010010010010110101100001010010010010110110111011011000010100100011100001010111010110000101001000111000010101111011100001010010010010110101100001010010010010110110111011001000010 c291c2bac291c2bdc2925ac2925b76c291c2bac291c2bdc2925ac2925b7642
EUC-JP ツ堕コツ堕スツ短ツ端vツ堕コツ堕スツ短ツ端vB 10001110110000101100001011000100100011101011101010001110110000101100001011000100100011101011110110001110110000101100001110111011100011101100001011000011101111000111011010001110110000101100001011000100100011101011101010001110110000101100001011000100100011101011110110001110110000101100001110111011100011101100001011000011101111000111011001000010 8ec2c2c48eba8ec2c2c48ebd8ec2c3bb8ec2c3bc768ec2c2c48eba8ec2c2c48ebd8ec2c3bb8ec2c3bc7642
UTF-8 ツ堕コツ堕スツ短ツ端vツ堕コツ堕スツ短ツ端vB 111011111011111010000010111001011010000010010101111011111011110110111010111011111011111010000010111001011010000010010101111011111011110110111101111011111011111010000010111001111001111110101101111011111011111010000010111001111010101110101111011101101110111110111110100000101110010110100000100101011110111110111101101110101110111110111110100000101110010110100000100101011110111110111101101111011110111110111110100000101110011110011111101011011110111110111110100000101110011110101011101011110111011001000010 efbe82e5a095efbdbaefbe82e5a095efbdbdefbe82e79fadefbe82e7abaf76efbe82e5a095efbdbaefbe82e5a095efbdbdefbe82e79fadefbe82e7abaf7642
UHC ???????短?端v???????短?端vB 001111110011111100111111001111110011111100111111001111111101001110101101001111111101001110101110011101100011111100111111001111110011111100111111001111110011111111010011101011010011111111010011101011100111011001000010 3f3f3f3f3f3f3fd3ad3fd3ae763f3f3f3f3f3f3fd3ad3fd3ae7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)