To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????L???? 00111111001111110011111100111111001111110100110000111111001111110011111100111111 3f3f3f3f3f4c3f3f3f3f
SJIS-WIN 害???削L???蒡 10001010010100010011111100111111001111111000110111101101010011000011111100111111001111111110010011101110 8a513f3f3f8ded4c3f3f3fe4ee
EUC-JP 害???削L???蒡 10110011101100100011111100111111001111111011101011101111010011000011111100111111001111111110100011110000 b3b23f3f3fbaef4c3f3f3fe8f0
UTF-8 害렟뤉롭削L렟뤉롭蒡 11100101101011101011001111101011101000001001111111101011101001001000100111101011101000011010110111100101100010011000101001001100111010111010000010011111111010111010010010001001111010111010000110101101111010001001001010100001 e5aeb3eba09feba489eba1ade5898a4ceba09feba489eba1ade892a1
UHC 害렟뤉롭削L렟뤉롭蒡 11111010101010101000111010110000100011111011100110110111110100111101111011111011010011001000111010110000100011111011100110110111110100111101101110111100 faaa8eb08fb9b7d3defb4c8eb08fb9b7d3dbbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)