To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 荳茨スヲ踝セ訷疲豬 111001001011100010001000111011111011110110100110111001101111010010111110111110111010010010010100111001101110011010110101 e4b888efbda6e6f4befba494e6e6b5
EUC-JP 荳茨スヲ踝セ訷疲豬 11101000101110101011000011110001100011101011110110001110101001101110110011110110100011101011111010001111110111011101010011001000111010001110110010110111 e8bab0f18ebd8ea6ecf68ebe8fddd4c8e8ecb7
UTF-8 荳茨スヲ踝セ訷疲豬 111010001000110110110011111010001000110010101000111011111011110110111101111011111011110110100110111010001011100010011101111011111011110110111110111010001010100010110111111001111001011010110010111010001011000110101100 e88db3e88ca8efbdbdefbda6e8b89defbdbee8a8b7e796b2e8b1ac
UHC 荳茨?????疲? 110101001110010111101101101111000011111100111111001111110011111100111111111110011010101000111111 d4e5edbc3f3f3f3f3ff9aa3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)