To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 淅壓夭W^淅壓夭\}v淅壓夭W^淅壓夭\}vB 1001111111000110100110101101100010011010111011100101011101011110100111111100011010011010110110001001101011101110010111000111110101110110100111111100011010011010110110001001101011101110010101110101111010011111110001101001101011011000100110101110111001011100011111010111011001000010 9fc69ad89aee575e9fc69ad89aee5c7d769fc69ad89aee575e9fc69ad89aee5c7d7642
EUC-JP 淅壓夭W^淅壓夭\}v淅壓夭W^淅壓夭\}vB 1101111011001000110101001101101011010100111100000101011101011110110111101100100011010100110110101101010011110000010111000111110101110110110111101100100011010100110110101101010011110000010101110101111011011110110010001101010011011010110101001111000001011100011111010111011001000010 dec8d4dad4f0575edec8d4dad4f05c7d76dec8d4dad4f0575edec8d4dad4f05c7d7642
UTF-8 淅壓夭W^淅壓夭\}v淅壓夭W^淅壓夭\}vB 1110011010110111100001011110010110100011100100111110010110100100101011010101011101011110111001101011011110000101111001011010001110010011111001011010010010101101010111000111110101110110111001101011011110000101111001011010001110010011111001011010010010101101010101110101111011100110101101111000010111100101101000111001001111100101101001001010110101011100011111010111011001000010 e6b785e5a393e5a4ad575ee6b785e5a393e5a4ad5c7d76e6b785e5a393e5a4ad575ee6b785e5a393e5a4ad5c7d7642
UHC 淅壓夭W^淅壓夭\}v淅壓夭W^淅壓夭\}vB 1110000010110010111001001110001011101000111011000101011101011110111000001011001011100100111000101110100011101100010111000111110101110110111000001011001011100100111000101110100011101100010101110101111011100000101100101110010011100010111010001110110001011100011111010111011001000010 e0b2e4e2e8ec575ee0b2e4e2e8ec5c7d76e0b2e4e2e8ec575ee0b2e4e2e8ec5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)