To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nkf???nk^}Y???nkf???nk^}bE 0011111100111111001111110110111001101011011001100011111100111111001111110110111001101011010111100111110101011001001111110011111100111111011011100110101101100110001111110011111100111111011011100110101101011110011111010110001001000101 3f3f3f6e6b663f3f3f6e6b5e7d593f3f3f6e6b663f3f3f6e6b5e7d6245
SJIS-WIN 惺殲宣nkf惺殲宣nk^}Y惺殲宣nkf惺殲宣nk^}bE 1001110010110111100111110111001010010000111010010110111001101011011001101001110010110111100111110111001010010000111010010110111001101011010111100111110101011001100111001011011110011111011100101001000011101001011011100110101101100110100111001011011110011111011100101001000011101001011011100110101101011110011111010110001001000101 9cb79f7290e96e6b669cb79f7290e96e6b5e7d599cb79f7290e96e6b669cb79f7290e96e6b5e7d6245
EUC-JP 惺殲宣nkf惺殲宣nk^}Y惺殲宣nkf惺殲宣nk^}bE 1101100010111001110111011101001111000000111010110110111001101011011001101101100010111001110111011101001111000000111010110110111001101011010111100111110101011001110110001011100111011101110100111100000011101011011011100110101101100110110110001011100111011101110100111100000011101011011011100110101101011110011111010110001001000101 d8b9ddd3c0eb6e6b66d8b9ddd3c0eb6e6b5e7d59d8b9ddd3c0eb6e6b66d8b9ddd3c0eb6e6b5e7d6245
UTF-8 惺殲宣nkf惺殲宣nk^}Y惺殲宣nkf惺殲宣nk^}bE 1110011010000011101110101110011010101110101100101110010110101110101000110110111001101011011001101110011010000011101110101110011010101110101100101110010110101110101000110110111001101011010111100111110101011001111001101000001110111010111001101010111010110010111001011010111010100011011011100110101101100110111001101000001110111010111001101010111010110010111001011010111010100011011011100110101101011110011111010110001001000101 e683bae6aeb2e5aea36e6b66e683bae6aeb2e5aea36e6b5e7d59e683bae6aeb2e5aea36e6b66e683bae6aeb2e5aea36e6b5e7d6245
UHC 惺殲宣nkf惺殲宣nk^}Y惺殲宣nkf惺殲宣nk^}bE 1110000011110110111000001110100011100000101111100110111001101011011001101110000011110110111000001110100011100000101111100110111001101011010111100111110101011001111000001111011011100000111010001110000010111110011011100110101101100110111000001111011011100000111010001110000010111110011011100110101101011110011111010110001001000101 e0f6e0e8e0be6e6b66e0f6e0e8e0be6e6b5e7d59e0f6e0e8e0be6e6b66e0f6e0e8e0be6e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)