To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堤?町?辛?障豆?麥??町?辛?障豆?龍 10010010111001110011111110010010101011000011111110010000011010000011111110001111111000011001001110100100001111111110101001101101001111110011111110010010101011000011111110010000011010000011111110001111111000011001001110100100001111111001011110110100 92e73f92ac3f90683f8fe193a43fea6d3f3f92ac3f90683f8fe193a43f97b4
EUC-JP 堤?町?辛?障豆?麥??町?辛?障豆?龍 11000100111010010011111111000100101011100011111110111111110010010011111110111110111000111100011010100110001111111111001111001110001111110011111111000100101011100011111110111111110010010011111110111110111000111100011010100110001111111100111010110110 c4e93fc4ae3fbfc93fbee3c6a63ff3ce3f3fc4ae3fbfc93fbee3c6a63fceb6
UTF-8 堤렚町렑辛렮障豆렲麥렏렚町렑辛렮障豆렲龍 111001011010000010100100111010111010000010011010111001111001010010111010111010111010000010010001111010001011111010011011111010111010000010101110111010011001101010011100111010001011000110000110111010111010000010110010111010011011101010100101111010111010000010001111111010111010000010011010111001111001010010111010111010111010000010010001111010001011111010011011111010111010000010101110111010011001101010011100111010001011000110000110111010111010000010110010111010011011111010001101 e5a0a4eba09ae794baeba091e8be9beba0aee99a9ce8b186eba0b2e9baa5eba08feba09ae794baeba091e8be9beba0aee99a9ce8b186eba0b2e9be8d
UHC 堤렚町렑辛렮障豆렲麥렏렚町렑辛렮障豆렲龍 11110000101001111000111010101101111011111110101110001110101001101110001111110100100011101011101111101110101000011101010011100111100011101011111111011000111010101000111010100101100011101010110111101111111010111000111010100110111000111111010010001110101110111110111010100001110101001110011110001110101111111101011110100011 f0a78eadefeb8ea6e3f48ebbeea1d4e78ebfd8ea8ea58eadefeb8ea6e3f48ebbeea1d4e78ebfd7a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)