To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 祚狢狢W^祚狢狢\}v祚狢狢W^祚狢狢\}vB 1110001001001110111000001100000011100000110000000101011101011110111000100100111011100000110000001110000011000000010111000111110101110110111000100100111011100000110000001110000011000000010101110101111011100010010011101110000011000000111000001100000001011100011111010111011001000010 e24ee0c0e0c0575ee24ee0c0e0c05c7d76e24ee0c0e0c0575ee24ee0c0e0c05c7d7642
EUC-JP 祚狢狢W^祚狢狢\}v祚狢狢W^祚狢狢\}vB 1110001110101111111000001100001011100000110000100101011101011110111000111010111111100000110000101110000011000010010111000111110101110110111000111010111111100000110000101110000011000010010101110101111011100011101011111110000011000010111000001100001001011100011111010111011001000010 e3afe0c2e0c2575ee3afe0c2e0c25c7d76e3afe0c2e0c2575ee3afe0c2e0c25c7d7642
UTF-8 祚狢狢W^祚狢狢\}v祚狢狢W^祚狢狢\}vB 1110011110100101100110101110011110001011101000101110011110001011101000100101011101011110111001111010010110011010111001111000101110100010111001111000101110100010010111000111110101110110111001111010010110011010111001111000101110100010111001111000101110100010010101110101111011100111101001011001101011100111100010111010001011100111100010111010001001011100011111010111011001000010 e7a59ae78ba2e78ba2575ee7a59ae78ba2e78ba25c7d76e7a59ae78ba2e78ba2575ee7a59ae78ba2e78ba25c7d7642
UHC 祚??W^祚??\}v祚??W^祚??\}vB 111100001101010000111111001111110101011101011110111100001101010000111111001111110101110001111101011101101111000011010100001111110011111101010111010111101111000011010100001111110011111101011100011111010111011001000010 f0d43f3f575ef0d43f3f5c7d76f0d43f3f575ef0d43f3f5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)