To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??揖??魏??筌??愿??音??鸚?? 11101001011001100011111100111111100101110100101100111111001111111110100110110000001111110011111111100010101000110011111100111111100111001100001100111111001111111000100110111001001111110011111111101010010111110011111100111111 e9663f3f974b3f3fe9b03f3fe2a33f3f9cc33f3f89b93f3fea5f3f3f
EUC-JP 馭??揖??魏??筌??愿??音??鸚?? 11110001110001110011111100111111110011011010110000111111001111111111001010110010001111110011111111100100101001010011111100111111110110001100010100111111001111111011001010111011001111110011111111110011110000000011111100111111 f1c73f3fcdac3f3ff2b23f3fe4a53f3fd8c53f3fb2bb3f3ff3c03f3f
UTF-8 馭곥룊揖뀐㎖魏놁숯筌욎쥙愿묕쭪音녹궒鸚룸쁻 111010011010011010101101111010101011001110100101111010111010001110001010111001101000111110010110111010111000000010010000111000111000111010010110111010011010110110001111111010111000011010000001111011001000100010101111111001111010110110001100111011001001101010001110111011001010010110011001111001101000010010111111111010111010110010010101111011001010110110101010111010011001111110110011111010111000010110111001111010101011011010010010111010011011100010011010111010111010001110111000111011001000000110111011 e9a6adeab3a5eba38ae68f96eb8090e38e96e9ad8feb8681ec88afe7ad8cec9a8eeca599e684bfebac95ecadaae99fb3eb85b9eab692e9b89aeba3b8ec81bb
UHC 馭곥룊揖뀐㎖魏놁숯筌욎쥙愿묕쭪音녹궒鸚룸쁻 111001011101111110000001111000111000111110001001111010111110011110110010111011111010011110100010111010101110000010000110111011001011110110100001111011111010011110011110111011001010001010001110111010101011010010010001111011111010011110011110111010111110010110110011111011001000001010100111111001011010010010110111111010111001100010000010 e5df81e38f89ebe7b2efa7a2eae086ecbda1efa79eeca28eeab491efa79eebe5b3ec82a7e5a4b7eb9882

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)