To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 上眈ャ捨濵竺杓痔}v上眈ャ捨濵竺杓痔}vB 100011111110001111100001101111001010110011110000101111111000111011001100111110110100110110001110101100011000111011011011100011101010010001111101011101101000111111100011111000011011110010101100111100001011111110001110110011001111101101001101100011101011000110001110110110111000111010100100011111010111011001000010 8fe3e1bcacf0bf8eccfb4d8eb18edb8ea47d768fe3e1bcacf0bf8eccfb4d8eb18edb8ea47d7642
EUC-JP 上眈ャ?捨濵竺杓痔}v上眈ャ?捨濵竺杓痔}vB 1011111011100101111000101011111010001110101011000011111110111100110011101000111111001001101001101011110010110011101111001101110110111100101001100111110101110110101111101110010111100010101111101000111010101100001111111011110011001110100011111100100110100110101111001011001110111100110111011011110010100110011111010111011001000010 bee5e2be8eac3fbcce8fc9a6bcb3bcddbca67d76bee5e2be8eac3fbcce8fc9a6bcb3bcddbca67d7642
UTF-8 上眈ャ捨濵竺杓痔}v上眈ャ捨濵竺杓痔}vB 1110010010111000100010101110011110011100100010001110111110111101101011001110111010000001101111101110011010001101101010001110011010111111101101011110011110101011101110101110011010011101100100111110011110010111100101000111110101110110111001001011100010001010111001111001110010001000111011111011110110101100111011101000000110111110111001101000110110101000111001101011111110110101111001111010101110111010111001101001110110010011111001111001011110010100011111010111011001000010 e4b88ae79c88efbdacee81bee68da8e6bfb5e7abbae69d93e797947d76e4b88ae79c88efbdacee81bee68da8e6bfb5e7abbae69d93e797947d7642
UHC 上眈??捨?竺杓痔}v上眈??捨?竺杓痔}vB 1101111110111110111101111010111100111111001111111101111011010111001111111111010111100111111110001111010111110110110000000111110101110110110111111011111011110111101011110011111100111111110111101101011100111111111101011110011111111000111101011111011011000000011111010111011001000010 dfbef7af3f3fded73ff5e7f8f5f6c07d76dfbef7af3f3fded73ff5e7f8f5f6c07d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)