To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W^????\}v????W^????\}vB 001111110011111100111111001111110101011101011110001111110011111100111111001111110101110001111101011101100011111100111111001111110011111101010111010111100011111100111111001111110011111101011100011111010111011001000010 3f3f3f3f575e3f3f3f3f5c7d763f3f3f3f575e3f3f3f3f5c7d7642
SJIS-WIN 上リシスW^上リシス\}v上リシスW^上リシス\}vB 10001111111000111101100010111100101111010101011101011110100011111110001111011000101111001011110101011100011111010111011010001111111000111101100010111100101111010101011101011110100011111110001111011000101111001011110101011100011111010111011001000010 8fe3d8bcbd575e8fe3d8bcbd5c7d768fe3d8bcbd575e8fe3d8bcbd5c7d7642
EUC-JP 上リシスW^上リシス\}v上リシスW^上リシス\}vB 10111110111001011000111011011000100011101011110010001110101111010101011101011110101111101110010110001110110110001000111010111100100011101011110101011100011111010111011010111110111001011000111011011000100011101011110010001110101111010101011101011110101111101110010110001110110110001000111010111100100011101011110101011100011111010111011001000010 bee58ed88ebc8ebd575ebee58ed88ebc8ebd5c7d76bee58ed88ebc8ebd575ebee58ed88ebc8ebd5c7d7642
UTF-8 上リシスW^上リシス\}v上リシスW^上リシス\}vB 1110010010111000100010101110111110111110100110001110111110111101101111001110111110111101101111010101011101011110111001001011100010001010111011111011111010011000111011111011110110111100111011111011110110111101010111000111110101110110111001001011100010001010111011111011111010011000111011111011110110111100111011111011110110111101010101110101111011100100101110001000101011101111101111101001100011101111101111011011110011101111101111011011110101011100011111010111011001000010 e4b88aefbe98efbdbcefbdbd575ee4b88aefbe98efbdbcefbdbd5c7d76e4b88aefbe98efbdbcefbdbd575ee4b88aefbe98efbdbcefbdbd5c7d7642
UHC 上???W^上???\}v上???W^上???\}vB 11011111101111100011111100111111001111110101011101011110110111111011111000111111001111110011111101011100011111010111011011011111101111100011111100111111001111110101011101011110110111111011111000111111001111110011111101011100011111010111011001000010 dfbe3f3f3f575edfbe3f3f3f5c7d76dfbe3f3f3f575edfbe3f3f3f5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)