To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??援??揄??嚴щ?釗??邑??筌 111000011001111100111111001111111000100110000111001111110011111110011101100010010011111100111111100110101000111010000100100010110011111111111011101110110011111100111111100101110101011100111111001111111110001010100011 e19f3f3f89873f3f9d893f3f9a8e848b3ffbbb3f3f97573f3fe2a3
EUC-JP 癲??援??揄??嚴щ?釗??邑??筌 11100010101000010011111100111111101100011110011100111111001111111101100111101001001111110011111111010011111011101010011111101011001111111000111111100011101001100011111100111111110011011011100000111111001111111110010010100101 e2a13f3fb1e73f3fd9e93f3fd3eea7eb3f8fe3a63f3fcdb83f3fe4a5
UTF-8 癲삳뀍援잒짆揄몄쵁嚴щ쵐釗볡솾邑노펶筌 1110011110011001101100101110110010000010101100111110101110000000100011011110011010001111101101001110110010011110100100101110110010100111100001101110011010001111100001001110101110101010100001001110110010110101100000011110010110011010101101001101000110001001111011001011010110010000111010011000011110010111111010111011001110100001111011001000011010111110111010011000001010010001111010111000010110111000111011011000111010110110111001111010110110001100 e799b2ec82b3eb808de68fb4ec9e92eca786e68f84ebaa84ecb581e59ab4d189ecb590e98797ebb3a1ec86bee98291eb85b8ed8eb6e7ad8c
UHC 癲삳뀍援잒짆揄몄쵁嚴щ쵐釗볡솾邑노펶筌 1110111110100110101110111110101110000101100010001110101010110101100111111110100010100011100101011110101011110001101110001110110010101100100000111110010111110001101011001110101110101100100100101110000111110010100100111110011110011001101100101110101111101001101100111110101110111100100001111110111110100111 efa6bbeb8588eab59fe8a395eaf1b8ecac83e5f1acebac92e1f293e799b2ebe9b3ebbc87efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)