To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 迺ス閾ェ豐サ驟圭n}迺ス閾ェ豐サ驟圭n{^ 11100111100100101011110111101000100001111010101011100110101100101011101111101001100001011000110001011100011011100111110111100111100100101011110111101000100001111010101011100110101100101011101111101001100001011000110001011100011011100111101101011110 e792bde887aae6b2bbe9858c5c6e7de792bde887aae6b2bbe9858c5c6e7b5e
EUC-JP 迺ス閾ェ豐サ驟圭n}迺ス閾ェ豐サ驟圭n{^ 11101101111100101000111010111101111011111110011110001110101010101110110010110100100011101011101111110001111001011011011110111101011011100111110111101101111100101000111010111101111011111110011110001110101010101110110010110100100011101011101111110001111001011011011110111101011011100111101101011110 edf28ebdefe78eaaecb48ebbf1e5b7bd6e7dedf28ebdefe78eaaecb48ebbf1e5b7bd6e7b5e
UTF-8 迺ス閾ェ豐サ驟圭n}迺ス閾ェ豐サ驟圭n{^ 1110100010111111101110101110111110111101101111011110100110010110101111101110111110111101101010101110100010110001100100001110111110111101101110111110100110101001100111111110010110011100101011010110111001111101111010001011111110111010111011111011110110111101111010011001011010111110111011111011110110101010111010001011000110010000111011111011110110111011111010011010100110011111111001011001110010101101011011100111101101011110 e8bfbaefbdbde996beefbdaae8b190efbdbbe9a99fe59cad6e7de8bfbaefbdbde996beefbdaae8b190efbdbbe9a99fe59cad6e7b5e
UHC ??????驟圭n}??????驟圭n{^ 00111111001111110011111100111111001111110011111111110110101011101101000010100100011011100111110100111111001111110011111100111111001111110011111111110110101011101101000010100100011011100111101101011110 3f3f3f3f3f3ff6aed0a46e7d3f3f3f3f3f3ff6aed0a46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)