To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???唯??源??嚴ъ????揄??裔??純 0011111100111111001111111001011101000010001111110011111110001100101110010011111100111111100110101000111010000100100011000011111100111111001111110011111110011101100010010011111100111111111001011110000100111111001111111000111110000011 3f3f3f97423f3f8cb93f3f9a8e848c3f3f3f3f9d893f3fe5e13f3f8f83
EUC-JP ???唯??源??嚴ъ?佾??揄??裔??純 00111111001111110011111111001101101000110011111100111111101110001011101100111111001111111101001111101110101001111110110000111111100011111011000011111011001111110011111111011001111010010011111100111111111010101110001100111111001111111011110111100011 3f3f3fcda33f3fb8bb3f3fd3eea7ec3f8fb0fb3f3fd9e93f3feae33f3fbde3
UTF-8 嶺뚢돦唯쎿룚源띲럶嚴ъ쥒佾띸춯揄먮즰裔꾩빢純 1110111110100110101010111110101110011010101000101110101110001111101001101110010110010100101011111110110010001110101111111110101110100011100110101110011010111010100100001110101110011101101100101110101110011111101101101110010110011010101101001101000110001010111011001010010110010010111001001011110110111110111010111001110110111000111011001011011010101111111001101000111110000100111010111010100010101110111011001010011010110000111010001010001110010100111010101011111010101001111010111011100110100010111001111011010010010100 efa6abeb9aa2eb8fa6e594afec8ebfeba39ae6ba90eb9db2eb9fb6e59ab4d18aeca592e4bdbeeb9db8ecb6afe68f84eba8aeeca6b0e8a394eabea9ebb9a2e7b494
UHC 嶺뚢돦唯쎿룚源띲럶嚴ъ쥒佾띸춯揄먮즰裔꾩빢純 1110011110101101100011001110001010001001101010101110101011100110100110111110011010001111100101101110101010111001100011011110001110001110100101011110010111110001101011001110110010100010100010011110110011101011100011011110011110101101100011001110101011110001100100001110101110100011100000101110011111100000100001001110110010010101101111101110001011101101 e7ad8ce289aaeae69be68f96eab98de38e95e5f1aceca289eceb8de7ad8ceaf190eba382e7e084ec95bee2ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)