To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 荳茨スヲ逍セ訷皮煤}荳茨スヲ逍セ訷皮煤{^ 111001001011100010001000111011111011110110100110111001111001011010111110111110111010010010010100111001111001010010000001011111011110010010111000100010001110111110111101101001101110011110010110101111101111101110100100100101001110011110010100100000010111101101011110 e4b888efbda6e796befba494e794817de4b888efbda6e796befba494e794817b5e
EUC-JP 荳茨スヲ逍セ訷皮煤}荳茨スヲ逍セ訷皮煤{^ 1110100010111010101100001111000110001110101111011000111010100110111011011111011010001110101111101000111111011101110101001100100011101001110001111110000101111101111010001011101010110000111100011000111010111101100011101010011011101101111101101000111010111110100011111101110111010100110010001110100111000111111000010111101101011110 e8bab0f18ebd8ea6edf68ebe8fddd4c8e9c7e17de8bab0f18ebd8ea6edf68ebe8fddd4c8e9c7e17b5e
UTF-8 荳茨スヲ逍セ訷皮煤}荳茨スヲ逍セ訷皮煤{^ 111010001000110110110011111010001000110010101000111011111011110110111101111011111011110110100110111010011000000010001101111011111011110110111110111010001010100010110111111001111001101010101110111001111000010110100100011111011110100010001101101100111110100010001100101010001110111110111101101111011110111110111101101001101110100110000000100011011110111110111101101111101110100010101000101101111110011110011010101011101110011110000101101001000111101101011110 e88db3e88ca8efbdbdefbda6e9808defbdbee8a8b7e79aaee785a47de88db3e88ca8efbdbdefbda6e9808defbdbee8a8b7e79aaee785a47b5e
UHC 荳茨??逍??皮煤}荳茨??逍??皮煤{^ 11010100111001011110110110111100001111110011111111100001110011100011111100111111111110011010101111011000111000000111110111010100111001011110110110111100001111110011111111100001110011100011111100111111111110011010101111011000111000000111101101011110 d4e5edbc3f3fe1ce3f3ff9abd8e07dd4e5edbc3f3fe1ce3f3ff9abd8e07b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)