To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 裼亥、ァv裼亥、ァvB 111001011110011110001000111001011010010010100111011101101110010111100111100010001110010110100100101001110111011001000010 e5e788e5a4a776e5e788e5a4a77642
EUC-JP 裼亥、ァv裼亥、ァvB 11101010111010011011000011100111100011101010010010001110101001110111011011101010111010011011000011100111100011101010010010001110101001110111011001000010 eae9b0e78ea48ea776eae9b0e78ea48ea77642
UTF-8 裼亥、ァv裼亥、ァvB 111010001010001110111100111001001011101010100101111011111011110110100100111011111011110110100111011101101110100010100011101111001110010010111010101001011110111110111101101001001110111110111101101001110111011001000010 e8a3bce4baa5efbda4efbda776e8a3bce4baa5efbda4efbda77642
UHC ?亥??v?亥??vB 00111111111110101010010000111111001111110111011000111111111110101010010000111111001111110111011001000010 3ffaa43f3f763ffaa43f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)