To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖?ぜ矣??熬??逾??柔k?沃 001111110011111100111111100101110100101100111111100000101011101011100001111000010011111100111111111000001001001000111111001111111110011110100101001111110011111110001111010111111000001010001011001111111001011110000000 3f3f3f974b3f82bae1e13f3fe0923f3fe7a53f3f8f5f828b3f9780
EUC-JP ???揖?ぜ矣??熬??逾??柔k?沃 001111110011111100111111110011011010110000111111101001001011110011100010111000110011111100111111110111111111001000111111001111111110111010100111001111110011111110111101110000001010001111101011001111111100110111100000 3f3f3fcdac3fa4bce2e33f3fdff23f3feea73f3fbdc0a3eb3fcde0
UTF-8 歷띰퐣揖띈ぜ矣뺤쭇熬곣뫁逾뤹춯柔k짋沃 111011111010011010001100111010111001110110110000111011011001000010100011111001101000111110010110111010111001110110001000111000111000000110011100111001111001111110100011111010111011101010100100111011001010110110000111111001111000011010101100111010101011001110100011111010111010101110000001111010011000000010111110111010111010010010111001111011001011011010101111111001101001111110010100111011111011110110001011111011001010011110001011111001101011001010000011 efa68ceb9db0ed90a3e68f96eb9d88e3819ce79fa3ebbaa4ecad87e786aceab3a3ebab81e980beeba4b9ecb6afe69f94efbd8beca78be6b283
UHC 歷띰퐣揖띈ぜ矣뺤쭇熬곣뫁逾뤹춯柔k짋沃 1110011010111000101101101110111110111101100011001110101111100111101101101110100010101010101111001110101111111000100101011110110010100111100000111110100010100010100000011110001010010001101001011110101110110101100011111110011110101101100011001110101011110101101000111110101110100011100101111110100010101010 e6b8b6efbd8cebe7b6e8aabcebf895eca783e8a281e291a5ebb58fe7ad8ceaf5a3eba397e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)