To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 譚楢ウェ迹カ迺ス螳、譚楢ウェ迹カ迺ス螳、^ 11100110100111011001001111101000101100111010101011100111100100011011011011100111100100101011110111100101101011101010010011100110100111011001001111101000101100111010101011100111100100011011011011100111100100101011110111100101101011101010010001011110 e69d93e8b3aae791b6e792bde5aea4e69d93e8b3aae791b6e792bde5aea45e
EUC-JP 譚楢ウェ迹カ迺ス螳、譚楢ウェ迹カ迺ス螳、^ 1110101111111101110001101110101010001110101100111000111010101010111011011111000110001110101101101110110111110010100011101011110111101010101100001000111010100100111010111111110111000110111010101000111010110011100011101010101011101101111100011000111010110110111011011111001010001110101111011110101010110000100011101010010001011110 ebfdc6ea8eb38eaaedf18eb6edf28ebdeab08ea4ebfdc6ea8eb38eaaedf18eb6edf28ebdeab08ea45e
UTF-8 譚楢ウェ迹カ迺ス螳、譚楢ウェ迹カ迺ス螳、^ 11101000101011011001101011100110101001011010001011101111101111011011001111101111101111011010101011101000101111111011100111101111101111011011011011101000101111111011101011101111101111011011110111101000100111101011001111101111101111011010010011101000101011011001101011100110101001011010001011101111101111011011001111101111101111011010101011101000101111111011100111101111101111011011011011101000101111111011101011101111101111011011110111101000100111101011001111101111101111011010010001011110 e8ad9ae6a5a2efbdb3efbdaae8bfb9efbdb6e8bfbaefbdbde89eb3efbda4e8ad9ae6a5a2efbdb3efbdaae8bfb9efbdb6e8bfbaefbdbde89eb3efbda45e
UHC 譚楢??迹???螳?譚楢??迹???螳?^ 1101001111001001111010101111100100111111001111111110111011101001001111110011111100111111110100111101100100111111110100111100100111101010111110010011111100111111111011101110100100111111001111110011111111010011110110010011111101011110 d3c9eaf93f3feee93f3f3fd3d93fd3c9eaf93f3feee93f3f3fd3d93f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)