To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 儼???嚥????????恝泣ζ?純??? 100110010101011000111111001111110011111110011010100010110011111100111111001111110011111100111111001111110011111100111111111110101011110010001011100000111000001111000100001111111000111110000011001111110011111100111111 99563f3f3f9a8b3f3f3f3f3f3f3f3ffabc8b8383c43f8f833f3f3f
EUC-JP 儼???嚥??彛??絪??恝泣ζ?純??? 1101000110110111001111110011111100111111110100111110101100111111001111111000111110111100111110100011111100111111100011111101001111101100001111110011111110001111101111011110011110110101111000111010011011000110001111111011110111100011001111110011111100111111 d1b73f3f3fd3eb3f3f8fbcfa3f3f8fd3ec3f3f8fbde7b5e3a6c63fbde33f3f3f
UTF-8 儼벿우뒫嚥싥룤彛띸솈絪싨뮄恝泣ζ퓴純앹춱捻 1110010110000100101111001110101110110010101111111110110010011010101100001110101110010010101010111110010110011010101001011110110010001011101001011110101110100011101001001110010110111101100110111110101110011101101110001110110010000110100010001110011110110101101010101110110010001011101010001110101110101110100001001110011010000001100111011110011010110011101000111100111010110110111011011001001110110100111001111011010010010100111011001001010110111001111011001011011010110001111011111010011010100100 e584bcebb2bfec9ab0eb92abe59aa5ec8ba5eba3a4e5bd9beb9db8ec8688e7b5aaec8ba8ebae84e6819de6b3a3ceb6ed93b4e7b494ec95b9ecb6b1efa6a4
UHC 儼벿우뒫嚥싥룤彛띸솈絪싨뮄恝泣ζ퓴純앹춱捻 111001011111000010010011110011101011111111101100100010101010010111100110101111111001101011100011100011111001110111101100101011011000110111100111100110011000110011101100110111111001101011100110100100101001001111001110101111111110101111101000101001011110011010111111100110101110001011101101100111011110110010101101100011011110011011110111 e5f093cebfec8aa5e6bf9ae38f9decad8de7998cecdf9ae69293cebfebe8a5e6bf9ae2ed9decad8de6f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)