To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???娃?????汚??鈺??節??域? 00111111001111110011111110001000101000010011111100111111001111110011111100111111100010011001100000111111001111111111101111000100001111110011111110010000110111110011111100111111100010001110011000111111 3f3f3f88a13f3f3f3f3f89983f3ffbc43f3f90df3f3f88e63f
EUC-JP 饔??娃??旿??汚??鈺??節??域? 100011111110100011101111001111110011111110110000101000110011111100111111100011111100000111110100001111110011111110110001111110000011111100111111100011111110001111010101001111110011111111000000111000010011111100111111101100001110100000111111 8fe8ef3f3fb0a33f3f8fc1f43f3fb1f83f3f8fe3d53f3fc0e13f3fb0e83f
UTF-8 饔ㅿ쉭娃륅쉭旿덌슐汚억슬鈺싨볜節곈죺域묪 111010011010010110010100111000111000010110111111111011001000100110101101111001011010100010000011111010111010010110000101111011001000100110101101111001101001011110111111111010111000110110001100111011001000101010010000111001101011000110011010111011001001011010110101111011001000101010101100111010011000100010111010111011001000101110101000111010111011001110011100111001111010111110000000111010101011001110001000111011001010001110111010111001011001111110011111111010111010110010101010 e9a594e385bfec89ade5a883eba585ec89ade697bfeb8d8cec8a90e6b19aec96b5ec8aace988baec8ba8ebb39ce7af80eab388eca3bae59f9febacaa
UHC 饔ㅿ쉭娃륅쉭旿덌슐汚억슬鈺싨볜節곈죺域묪 11101000101111011010010011101111101111011010110111101000110111111000111111101111101111011010110111100111111110101000100011101111101111011011011011100111111111011011111011101111101111011011110111101000101011011001101011100110101110101011011111101111101111011011000011101001101000011001010011100110101101001001001001000010 e8bda4efbdade8df8fefbdade7fa88efbdb6e7fdbeefbdbde8ad9ae6bab7efbdb0e9a194e6b49242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)