To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 瘟??熬?オ鸚??D瘟??熬?オ鸚??D^ 1110000110001001001111110011111111100000100100100011111110000011010010011110101001011111001111110011111101000100111000011000100100111111001111111110000010010010001111111000001101001001111010100101111100111111001111110100010001011110 e1893f3fe0923f8349ea5f3f3f44e1893f3fe0923f8349ea5f3f3f445e
EUC-JP 瘟??熬?オ鸚??D瘟??熬?オ鸚??D^ 1110000111101001001111110011111111011111111100100011111110100101101010101111001111000000001111110011111101000100111000011110100100111111001111111101111111110010001111111010010110101010111100111100000000111111001111110100010001011110 e1e93f3fdff23fa5aaf3c03f3f44e1e93f3fdff23fa5aaf3c03f3f445e
UTF-8 瘟룩큹熬뽫オ鸚싧콪D瘟룩큹熬뽫オ鸚싧콪D^ 111001111001100010011111111010111010001110101001111011011000000110111001111001111000011010101100111010111011110110101011111000111000001010101010111010011011100010011010111011001000101110100111111011001011110110101010010001001110011110011000100111111110101110100011101010011110110110000001101110011110011110000110101011001110101110111101101010111110001110000010101010101110100110111000100110101110110010001011101001111110110010111101101010100100010001011110 e7989feba3a9ed81b9e786acebbdabe382aae9b89aec8ba7ecbdaa44e7989feba3a9ed81b9e786acebbdabe382aae9b89aec8ba7ecbdaa445e
UHC 瘟룩큹熬뽫オ鸚싧콪D瘟룩큹熬뽫オ鸚싧콪D^ 111010001011000010110111111010001011010010001000111010001010001010010110111001111010101110101010111001011010010010011010111001011011000110011110010001001110100010110000101101111110100010110100100010001110100010100010100101101110011110101011101010101110010110100100100110101110010110110001100111100100010001011110 e8b0b7e8b488e8a296e7abaae5a49ae5b19e44e8b0b7e8b488e8a296e7abaae5a49ae5b19e445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)