To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋よ?娃??耶??鵝???ц?泳??瘟?? 1001101011001000100000101110011000111111100010001010000100111111001111111001011011101011001111110011111111101010010000000011111100111111001111111000010010001000001111111000100101101010001111110011111111100001100010010011111100111111 9ac882e63f88a13f3f96eb3f3fea403f3f3f84883f896a3f3fe1893f3f
EUC-JP 塋よ?娃??耶??鵝???ц?泳??瘟?? 1101010011001010101001001110100000111111101100001010001100111111001111111100110011101101001111110011111111110011101000010011111100111111001111111010011111101000001111111011000111001011001111110011111111100001111010010011111100111111 d4caa4e83fb0a33f3fcced3f3ff3a13f3f3fa7e83fb1cb3f3fe1e93f3f
UTF-8 塋よ떨娃쒏짎耶섇춼鵝녶렘歷ц꽦泳싩텥瘟룟렘 1110010110100001100010111110001110000010100010001110101110010110101010001110010110101000100000111110110010010010100011111110110010100111100011101110100010000000101101101110110010000100100001111110110010110110101111001110100110110101100111011110101110000101101101101110101110100000100110001110111110100110100011001101000110000110111010101011110110100110111001101011001110110011111011001000101110101001111011011000010110100101111001111001100010011111111010111010001110011111111010111010000010011000 e5a18be38288eb96a8e5a883ec928feca78ee880b6ec8487ecb6bce9b59deb85b6eba098efa68cd186eabda6e6b3b3ec8ba9ed85a5e7989feba39feba098
UHC 塋よ떨娃쒏짎耶섇춼鵝녶렘歷ц꽦泳싩텥瘟룟렘 111001111010101110101010111010001011011010110011111010001101111110011100111001101010001110011010111001011010110110011000111001011010110110011000111001001011110110000110111001011011011110111101111001101011100010101100111010001000010010110001111001111011011010011010111001111011011010011010111010001011000010110111111001011011011110111101 e7abaae8b6b3e8df9ce6a39ae5ad98e5ad98e4bd86e5b7bde6b8ace884b1e7b69ae7b69ae8b0b7e5b7bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)