To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 偲萓痔W^偲萓痔\}v偲萓痔W^偲萓痔\}vB 1000111011000011111001001011111010001110101001000101011101011110100011101100001111100100101111101000111010100100010111000111110101110110100011101100001111100100101111101000111010100100010101110101111010001110110000111110010010111110100011101010010001011100011111010111011001000010 8ec3e4be8ea4575e8ec3e4be8ea45c7d768ec3e4be8ea4575e8ec3e4be8ea45c7d7642
EUC-JP 偲萓痔W^偲萓痔\}v偲萓痔W^偲萓痔\}vB 1011110011000101111010001100000010111100101001100101011101011110101111001100010111101000110000001011110010100110010111000111110101110110101111001100010111101000110000001011110010100110010101110101111010111100110001011110100011000000101111001010011001011100011111010111011001000010 bcc5e8c0bca6575ebcc5e8c0bca65c7d76bcc5e8c0bca6575ebcc5e8c0bca65c7d7642
UTF-8 偲萓痔W^偲萓痔\}v偲萓痔W^偲萓痔\}vB 1110010110000001101100101110100010010000100100111110011110010111100101000101011101011110111001011000000110110010111010001001000010010011111001111001011110010100010111000111110101110110111001011000000110110010111010001001000010010011111001111001011110010100010101110101111011100101100000011011001011101000100100001001001111100111100101111001010001011100011111010111011001000010 e581b2e89093e79794575ee581b2e89093e797945c7d76e581b2e89093e79794575ee581b2e89093e797945c7d7642
UHC ??痔W^??痔\}v??痔W^??痔\}vB 001111110011111111110110110000000101011101011110001111110011111111110110110000000101110001111101011101100011111100111111111101101100000001010111010111100011111100111111111101101100000001011100011111010111011001000010 3f3ff6c0575e3f3ff6c05c7d763f3ff6c0575e3f3ff6c05c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)