To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^h???^fN}???^h???^fN{^ 00111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111110100111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111101101011110 3f3f3f5e683f3f3f5e664e7d3f3f3f5e683f3f3f5e664e7b5e
SJIS-WIN ?癌殃^h?癌殃^fN}?癌殃^h?癌殃^fN{^ 001111111000101011100000100111110110100101011110011010000011111110001010111000001001111101101001010111100110011001001110011111010011111110001010111000001001111101101001010111100110100000111111100010101110000010011111011010010101111001100110010011100111101101011110 3f8ae09f695e683f8ae09f695e664e7d3f8ae09f695e683f8ae09f695e664e7b5e
EUC-JP ?癌殃^h?癌殃^fN}?癌殃^h?癌殃^fN{^ 001111111011010011100010110111011100101001011110011010000011111110110100111000101101110111001010010111100110011001001110011111010011111110110100111000101101110111001010010111100110100000111111101101001110001011011101110010100101111001100110010011100111101101011110 3fb4e2ddca5e683fb4e2ddca5e664e7d3fb4e2ddca5e683fb4e2ddca5e664e7b5e
UTF-8 卨癌殃^h卨癌殃^fN}卨癌殃^h卨癌殃^fN{^ 11100101100011011010100011100111100110011000110011100110101011101000001101011110011010001110010110001101101010001110011110011001100011001110011010101110100000110101111001100110010011100111110111100101100011011010100011100111100110011000110011100110101011101000001101011110011010001110010110001101101010001110011110011001100011001110011010101110100000110101111001100110010011100111101101011110 e58da8e7998ce6ae835e68e58da8e7998ce6ae835e664e7de58da8e7998ce6ae835e68e58da8e7998ce6ae835e664e7b5e
UHC 卨癌殃^h卨癌殃^fN}卨癌殃^h卨癌殃^fN{^ 11100000110110011110010011011111111001001110101001011110011010001110000011011001111001001101111111100100111010100101111001100110010011100111110111100000110110011110010011011111111001001110101001011110011010001110000011011001111001001101111111100100111010100101111001100110010011100111101101011110 e0d9e4dfe4ea5e68e0d9e4dfe4ea5e664e7de0d9e4dfe4ea5e68e0d9e4dfe4ea5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)