To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汁???姐????源烽???源烽???應?B 1000111101100000001111110011111100111111100010001011011100111111001111110011111100111111100011001011100111100000100000100011111100111111001111111000110010111001111000001000001000111111001111110011111110011100111001000011111101000010 8f603f3f3f88b73f3f3f3f8cb9e0823f3f3f8cb9e0823f3f3f9ce43f42
EUC-JP 汁???姐????源烽???源烽???應?B 1011110111000001001111110011111100111111101100001011100100111111001111110011111100111111101110001011101111011111111000100011111100111111001111111011100010111011110111111110001000111111001111110011111111011000111001100011111101000010 bdc13f3f3fb0b93f3f3f3fb8bbdfe23f3f3fb8bbdfe23f3f3fd8e63f42
UTF-8 汁흗렓렜姐븍웃渽렜源烽웃渽렜源烽웃渽렜應렱B 11100110101100011000000111101101100111011001011111101011101000001001001111101011101000001001110011100101101001111001000011101011101110001000110111101100100110111000001111100110101110001011110111101011101000001001110011100110101110101001000011100111100000111011110111101100100110111000001111100110101110001011110111101011101000001001110011100110101110101001000011100111100000111011110111101100100110111000001111100110101110001011110111101011101000001001110011100110100001111000100111101011101000001011000101000010 e6b181ed9d97eba093eba09ce5a790ebb88dec9b83e6b8bdeba09ce6ba90e783bdec9b83e6b8bdeba09ce6ba90e783bdec9b83e6b8bdeba09ce68789eba0b142
UHC 汁흗렓렜姐븍웃渽렜源烽웃渽렜源烽웃渽렜應렱B 11110001111100001100100011101001100011101010100010001110101011101110111010111011101110101110101110111111111101001110111010101010100011101010111011101010101110011101110011101011101111111111010011101110101010101000111010101110111010101011100111011100111010111011111111110100111011101010101010001110101011101110101111101011100011101011111001000010 f1f0c8e98ea88eaeeebbbaebbff4eeaa8eaeeab9dcebbff4eeaa8eaeeab9dcebbff4eeaa8eaeebeb8ebe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)