To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???徇??誘?η??l????異⑨?? 0011111100111111001111111001110001101101001111110011111110010111010101010011111110000011110001010011111100111111100000101000110000111111001111110011111100111111100010001101100110000111010010000011111100111111 3f3f3f9c6d3f3f97553f83c53f3f828c3f3f3f3f88d987483f3f
EUC-JP ???徇??誘?η?洹l????異??佾 0011111100111111001111111101011111001110001111110011111111001101101101100011111110100110110001110011111110001111110001111011101010100011111011000011111100111111001111110011111110110000110110110011111100111111100011111011000011111011 3f3f3fd7ce3f3fcdb63fa6c73f8fc7baa3ec3f3f3f3fb0db3f3f8fb0fb
UTF-8 銳얜끂徇쒐땟誘⒟η땟洹l난銳얇룗異⑨쭗佾 1110100110001010101100111110110010010110100111001110101110000001100000101110010110111110100001111110110010010010100100001110101110010101100111111110100010101010100110001110001010010010100111111100111010110111111010111001010110011111111001101011010010111001111011111011110110001100111010111000001010011100111010011000101010110011111011001001011010000111111010111010001110010111111001111001010110110000111000101001000110101000111011001010110110010111111001001011110110111110 e98ab3ec969ceb8182e5be87ec9290eb959fe8aa98e2929fceb7eb959fe6b4b9efbd8ceb829ce98ab3ec9687eba397e795b0e291a8ecad97e4bdbe
UHC 銳얜끂徇쒐땟誘⒟η땟洹l난銳얇룗異⑨쭗佾 11100111111001011011111011101011100001011011100011100010110111111001110011100111101101101010110111101011101011111010100111010000101001011110011110110110101011011110101010110111101000111110110010110011101011011110011111100101101111101110001110001111100100111110110010110110101010001110111110100111100011111110110011101011 e7e5beeb85b8e2df9ce7b6adebafa9d0a5e7b6adeab7a3ecb3ade7e5bee38f93ecb6a8efa78feceb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)