To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 肢私????鏃?鏃?詞????忿基?蛟??^ 100011101000100010001110100001000011111100111111001111110011111111101000010101100011111111101000010101100011111110001110100011000011111100111111001111110011111110011100011111001000101011101110001111111110010110000000001111110011111101011110 8e888e843f3f3f3fe8563fe8563f8e8c3f3f3f3f9c7c8aee3fe5803f3f5e
EUC-JP 肢私????鏃?鏃?詞????忿基?蛟??^ 101110111110100010111011111001000011111100111111001111110011111111101111101101110011111111101111101101110011111110111011111011000011111100111111001111110011111111010111110111011011010011110000001111111110100111100000001111110011111101011110 bbe8bbe43f3f3f3fefb73fefb73fbbec3f3f3f3fd7ddb4f03fe9e03f3f5e
UTF-8 肢私렎렠뤯훵鏃앓鏃씜詞뜅咽쨴콓忿基렱蛟렱쌤^ 11101000100000101010001011100111101001111000000111101011101000001000111011101011101000001010000011101011101001001010111111101101100110111011010111101001100011111000001111101100100101011001001111101001100011111000001111101100100101001001110011101000101010011001111011101011100111001000010111101111101001101001111011101100101010001011010011101100101111011001001111100101101111111011111111100101100111111011101011101011101000001011000111101000100110111001111111101011101000001011000111101100100011001010010001011110 e882a2e7a781eba08eeba0a0eba4afed9bb5e98f83ec9593e98f83ec949ce8a99eeb9c85efa69eeca8b4ecbd93e5bfbfe59fbaeba0b1e89b9feba0b1ec8ca45e
UHC 肢私렎렠뤯훵鏃앓鏃씜詞뜅咽쨴콓忿基렱蛟렱쌤^ 11110010101101101101111011100111100011101010010010001110101100011000111111011101110010001101000011110000111011001011111011001110111100001110110010111110101111011101111011110010101101101101111011100110111011001010010010001110101100011000111111011101110010001101000011110001100011101011111011001110111100011000111010111110101111011101110001011110 f2b6dee78ea48eb18fddc8d0f0ecbecef0ecbebddef2b6dee6eca48eb18fddc8d0f18ebecef18ebebddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)