To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 源?障?垣企???紆?梯?畯脈?梯?趙孟∧ 100011001011100100111111100011111110000100111111100010100101111110001010111010010011111100111111001111111110001011111100001111111001001011110010001111111111101101101111100101101010110000111111100100101111001000111111111001101110001010010110110100001000000111001000 8cb93f8fe13f8a5f8ae93f3f3fe2fc3f92f23ffb6f96ac3f92f23fe6e296d081c8
EUC-JP 源?障?垣企?塼?紆?梯?畯脈?梯?趙孟∧ 101110001011101100111111101111101110001100111111101100111100000010110100111010110011111110001111101110001011100100111111111001001111111000111111110001001111010000111111100011111100110110111011110011001010111000111111110001001111010000111111111011001110010011001100110100101010001011001010 b8bb3fbee33fb3c0b4eb3f8fb8b93fe4fe3fc4f43f8fcdbbccae3fc4f43fece4ccd2a2ca
UTF-8 源렰障렚垣企㉢塼렦紆렣梯렟畯脈歷梯렟趙孟∧ 111001101011101010010000111010111010000010110000111010011001101010011100111010111010000010011010111001011001111010100011111001001011110010000001111000111000100110100010111001011010000110111100111010111010000010100110111001111011010010000110111010111010000010100011111001101010001010101111111010111010000010011111111001111001010110101111111010001000010010001000111011111010011010001100111001101010001010101111111010111010000010011111111010001011011010011001111001011010110110011111111000101000100010100111 e6ba90eba0b0e99a9ceba09ae59ea3e4bc81e389a2e5a1bceba0a6e7b486eba0a3e6a2afeba09fe795afe88488efa68ce6a2afeba09fe8b699e5ad9fe288a7
UHC 源렰障렚垣企㉢塼렦紆렣梯렟畯脈歷梯렟趙孟∧ 111010101011100110001110101111011110111010100001100011101010110111101010101011111101000011101010101010001011001111101110111101001000111010110101111010011110000110001110101101001111000010101100100011101011000011110001111000011101100011100110111001101011100011110000101011001000111010110000111100001110000111011000111010111010000111111100 eab98ebdeea18eadeaafd0eaa8b3eef48eb5e9e18eb4f0ac8eb0f1e1d8e6e6b8f0ac8eb0f0e1d8eba1fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)