To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 踵門彫W^踵門彫\}v踵門彫W^踵門彫\}vB 1110011011111001100101101110010110010010101001000101011101011110111001101111100110010110111001011001001010100100010111000111110101110110111001101111100110010110111001011001001010100100010101110101111011100110111110011001011011100101100100101010010001011100011111010111011001000010 e6f996e592a4575ee6f996e592a45c7d76e6f996e592a4575ee6f996e592a45c7d7642
EUC-JP 踵門彫W^踵門彫\}v踵門彫W^踵門彫\}vB 1110110011111011110011001110011111000100101001100101011101011110111011001111101111001100111001111100010010100110010111000111110101110110111011001111101111001100111001111100010010100110010101110101111011101100111110111100110011100111110001001010011001011100011111010111011001000010 ecfbcce7c4a6575eecfbcce7c4a65c7d76ecfbcce7c4a6575eecfbcce7c4a65c7d7642
UTF-8 踵門彫W^踵門彫\}v踵門彫W^踵門彫\}vB 1110100010111000101101011110100110010110100000001110010110111101101010110101011101011110111010001011100010110101111010011001011010000000111001011011110110101011010111000111110101110110111010001011100010110101111010011001011010000000111001011011110110101011010101110101111011101000101110001011010111101001100101101000000011100101101111011010101101011100011111010111011001000010 e8b8b5e99680e5bdab575ee8b8b5e99680e5bdab5c7d76e8b8b5e99680e5bdab575ee8b8b5e99680e5bdab5c7d7642
UHC 踵門彫W^踵門彫\}v踵門彫W^踵門彫\}vB 1111000110100010110110101010011011110000110000010101011101011110111100011010001011011010101001101111000011000001010111000111110101110110111100011010001011011010101001101111000011000001010101110101111011110001101000101101101010100110111100001100000101011100011111010111011001000010 f1a2daa6f0c1575ef1a2daa6f0c15c7d76f1a2daa6f0c1575ef1a2daa6f0c15c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)