To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 上ヨ鴫竺湿宍邪酌上ヨ鴫竺湿宍邪灼^ 100011111110001111010110100011101011000011110011111011101000111010110001100011101011110011110001111011101000111010110011100011101101011110001110110111101000111111100011110101101000111010110000111100111110111010001110101100011000111010111100111100011110111010001110101100111000111011010111100011101101110001011110 8fe3d68eb0f3ee8eb18ebcf1ee8eb38ed78ede8fe3d68eb0f3ee8eb18ebcf1ee8eb38ed78edc5e
EUC-JP 上ヨ鴫?竺湿?宍邪酌上ヨ鴫?竺湿?宍邪灼^ 10111110111001011000111011010110101111001011001000111111101111001011001110111100101111100011111110111100101101011011110011011001101111001110000010111110111001011000111011010110101111001011001000111111101111001011001110111100101111100011111110111100101101011011110011011001101111001101111001011110 bee58ed6bcb23fbcb3bcbe3fbcb5bcd9bce0bee58ed6bcb23fbcb3bcbe3fbcb5bcd9bcde5e
UTF-8 上ヨ鴫竺湿宍邪酌上ヨ鴫竺湿宍邪灼^ 11100100101110001000101011101111101111101001011011101001101101001010101111101110100010111010000111100111101010111011101011100110101110011011111111101110100001011010100111100101101011101000110111101001100000101010101011101001100001011000110011100100101110001000101011101111101111101001011011101001101101001010101111101110100010111010000111100111101010111011101011100110101110011011111111101110100001011010100111100101101011101000110111101001100000101010101011100111100000011011110001011110 e4b88aefbe96e9b4abee8ba1e7abbae6b9bfee85a9e5ae8de982aae9858ce4b88aefbe96e9b4abee8ba1e7abbae6b9bfee85a9e5ae8de982aae781bc5e
UHC 上???竺???邪酌上???竺???邪灼^ 1101111110111110001111110011111100111111111101011110011100111111001111110011111111011110111101111110110111001100110111111011111000111111001111110011111111110101111001110011111100111111001111111101111011110111111011011100011101011110 dfbe3f3f3ff5e73f3f3fdef7edccdfbe3f3f3ff5e73f3f3fdef7edc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)