To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????e?????????e^ 001111110011111100111111001111110011111100111111001111110011111100111111011001010011111100111111001111110011111100111111001111110011111100111111001111110110010101011110 3f3f3f3f3f3f3f3f3f653f3f3f3f3f3f3f3f3f655e
SJIS-WIN ???歪??軟??e???歪??軟??e^ 00111111001111110011111110011000011000110011111100111111100100111110111000111111001111110110010100111111001111110011111110011000011000110011111100111111100100111110111000111111001111110110010101011110 3f3f3f98633f3f93ee3f3f653f3f3f98633f3f93ee3f3f655e
EUC-JP ???歪??軟??e???歪??軟??e^ 00111111001111110011111111001111110001000011111100111111110001101111000000111111001111110110010100111111001111110011111111001111110001000011111100111111110001101111000000111111001111110110010101011110 3f3f3fcfc43f3fc6f03f3f653f3f3fcfc43f3fc6f03f3f655e
UTF-8 亮몃젒歪귝뼴軟쒕젍e亮몃젒歪귝뼴軟쒕젍e^ 111011111010010110110111111010111010101010000011111011001010000010010010111001101010110110101010111010101011011110011101111010111011110010110100111010001011101110011111111011001001001010010101111011001010000010001101011001011110111110100101101101111110101110101010100000111110110010100000100100101110011010101101101010101110101010110111100111011110101110111100101101001110100010111011100111111110110010010010100101011110110010100000100011010110010101011110 efa5b7ebaa83eca092e6adaaeab79debbcb4e8bb9fec9295eca08d65efa5b7ebaa83eca092e6adaaeab79debbcb4e8bb9fec9295eca08d655e
UHC 亮몃젒歪귝뼴軟쒕젍e亮몃젒歪귝뼴軟쒕젍e^ 111001011011100110111000111010111010000010010001111010001110000010000010111001101001011010110111111001101110001110011100111010111010000010001110011001011110010110111001101110001110101110100000100100011110100011100000100000101110011010010110101101111110011011100011100111001110101110100000100011100110010101011110 e5b9b8eba091e8e082e696b7e6e39ceba08e65e5b9b8eba091e8e082e696b7e6e39ceba08e655e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)