To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 誧ァ郞誹刊郞韮 11111011101001101010011111111011101101101001010011101110100010101010011111111011101101101001010001000010 fba6a7fbb694ee8aa7fbb69442
EUC-JP 誧ァ?誹刊?韮 10001111110111011111101010001110101001110011111111001000111100001011010010101001001111111100011110100011 8fddfa8ea73fc8f0b4a93fc7a3
UTF-8 誧ァ郞誹刊郞韮 111010001010101010100111111011111011110110100111111010011000001110011110111010001010101010111001111001011000100010001010111010011000001110011110111010011001111110101110 e8aaa7efbda7e9839ee8aab9e5888ae9839ee99fae
UHC ??郞誹刊郞? 0011111100111111110101011100110111011110101001101100101011001010110101011100110100111111 3f3fd5cddea6cacad5cd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)