To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 渼爾渼フ魘、湜ユシ渼爾渼フ魘、湜ユシ^ 1111101101001001100011101010001011111011010010011100110011101001101101001010010011111011010001111101010110111100111110110100100110001110101000101111101101001001110011001110100110110100101001001111101101000111110101011011110001011110 fb498ea2fb49cce9b4a4fb47d5bcfb498ea2fb49cce9b4a4fb47d5bc5e
EUC-JP 渼爾渼フ魘、湜ユシ渼爾渼フ魘、湜ユシ^ 10001111110001111111000010111100101001001000111111000111111100001000111011001100111100101011011010001110101001001000111111000111111111001000111011010101100011101011110010001111110001111111000010111100101001001000111111000111111100001000111011001100111100101011011010001110101001001000111111000111111111001000111011010101100011101011110001011110 8fc7f0bca48fc7f08eccf2b68ea48fc7fc8ed58ebc8fc7f0bca48fc7f08eccf2b68ea48fc7fc8ed58ebc5e
UTF-8 渼爾渼フ魘、湜ユシ渼爾渼フ魘、湜ユシ^ 11100110101110001011110011100111100010001011111011100110101110001011110011101111101111101000110011101001101011011001100011101111101111011010010011100110101110011001110011101111101111101001010111101111101111011011110011100110101110001011110011100111100010001011111011100110101110001011110011101111101111101000110011101001101011011001100011101111101111011010010011100110101110011001110011101111101111101001010111101111101111011011110001011110 e6b8bce788bee6b8bcefbe8ce9ad98efbda4e6b99cefbe95efbdbce6b8bce788bee6b8bcefbe8ce9ad98efbda4e6b99cefbe95efbdbc5e
UHC 渼爾渼???湜??渼爾渼???湜??^ 110110101011010011101100101100111101101010110100001111110011111100111111111000111101011100111111001111111101101010110100111011001011001111011010101101000011111100111111001111111110001111010111001111110011111101011110 dab4ecb3dab43f3f3fe3d73f3fdab4ecb3dab43f3f3fe3d73f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)