To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 褻サ譏、賰呻セ橲譏サ褻、賰呻セ樶福^ 11100101111101101011101111100110100110001010010011111011101011011001100111101111101111101001111011101101111100111010000111100110100110001011101111100101111101101010010011111011101011011001100111101111101111101001111011101110100101011001111101011110 e5f6bbe698a4fbad99efbe9eedf3a1e698bbe5f6a4fbad99efbe9eee959f5e
EUC-JP 褻サ譏、賰呻セ橲?譏サ褻、賰呻セ樶福^ 1110101011111000100011101011101111101011111110001000111010100100100011111101111110111001110100101111000110001110101111101101110011101111001111111110101111111000100011101011101111101010111110001000111010100100100011111101111110111001110100101111000110001110101111101101110011110000110010101010000101011110 eaf88ebbebf88ea48fdfb9d2f18ebedcef3febf88ebbeaf88ea48fdfb9d2f18ebedcf0caa15e
UTF-8 褻サ譏、賰呻セ橲譏サ褻、賰呻セ樶福^ 11101000101001001011101111101111101111011011101111101000101011011000111111101111101111011010010011101000101100111011000011100101100100011011101111101111101111011011111011100110101010011011001011101110100010101001010011101000101011011000111111101111101111011011101111101000101001001011101111101111101111011010010011101000101100111011000011100101100100011011101111101111101111011011111011100110101010001011011011100111101001101000111101011110 e8a4bbefbdbbe8ad8fefbda4e8b3b0e591bbefbdbee6a9b2ee8a94e8ad8fefbdbbe8a4bbefbda4e8b3b0e591bbefbdbee6a8b6e7a68f5e
UHC 褻?譏??呻???譏?褻??呻??福^ 1110000011100001001111111101000111000001001111110011111111100011111000100011111100111111001111111101000111000001001111111110000011100001001111110011111111100011111000100011111100111111110111001101100001011110 e0e13fd1c13f3fe3e23f3f3fd1c13fe0e13f3fe3e23f3fdcd85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)