To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nkf???nk^}Y???nkf???nk^}bE 0011111100111111001111110110111001101011011001100011111100111111001111110110111001101011010111100111110101011001001111110011111100111111011011100110101101100110001111110011111100111111011011100110101101011110011111010110001001000101 3f3f3f6e6b663f3f3f6e6b5e7d593f3f3f6e6b663f3f3f6e6b5e7d6245
SJIS-WIN 惺淳宣nkf惺淳宣nk^}Y惺淳宣nkf惺淳宣nk^}bE 1001110010110111100011110111111010010000111010010110111001101011011001101001110010110111100011110111111010010000111010010110111001101011010111100111110101011001100111001011011110001111011111101001000011101001011011100110101101100110100111001011011110001111011111101001000011101001011011100110101101011110011111010110001001000101 9cb78f7e90e96e6b669cb78f7e90e96e6b5e7d599cb78f7e90e96e6b669cb78f7e90e96e6b5e7d6245
EUC-JP 惺淳宣nkf惺淳宣nk^}Y惺淳宣nkf惺淳宣nk^}bE 1101100010111001101111011101111111000000111010110110111001101011011001101101100010111001101111011101111111000000111010110110111001101011010111100111110101011001110110001011100110111101110111111100000011101011011011100110101101100110110110001011100110111101110111111100000011101011011011100110101101011110011111010110001001000101 d8b9bddfc0eb6e6b66d8b9bddfc0eb6e6b5e7d59d8b9bddfc0eb6e6b66d8b9bddfc0eb6e6b5e7d6245
UTF-8 惺淳宣nkf惺淳宣nk^}Y惺淳宣nkf惺淳宣nk^}bE 1110011010000011101110101110011010110111101100111110010110101110101000110110111001101011011001101110011010000011101110101110011010110111101100111110010110101110101000110110111001101011010111100111110101011001111001101000001110111010111001101011011110110011111001011010111010100011011011100110101101100110111001101000001110111010111001101011011110110011111001011010111010100011011011100110101101011110011111010110001001000101 e683bae6b7b3e5aea36e6b66e683bae6b7b3e5aea36e6b5e7d59e683bae6b7b3e5aea36e6b66e683bae6b7b3e5aea36e6b5e7d6245
UHC 惺淳宣nkf惺淳宣nk^}Y惺淳宣nkf惺淳宣nk^}bE 1110000011110110111000101110100011100000101111100110111001101011011001101110000011110110111000101110100011100000101111100110111001101011010111100111110101011001111000001111011011100010111010001110000010111110011011100110101101100110111000001111011011100010111010001110000010111110011011100110101101011110011111010110001001000101 e0f6e2e8e0be6e6b66e0f6e2e8e0be6e6b5e7d59e0f6e2e8e0be6e6b66e0f6e2e8e0be6e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)