To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 轣檎汚謇」迚伜ク戯轣檎汚謇」迚伜ク誼E 111001111000000110001100111001111000100110011000111001101000100110100011111001111000100110011000111001011011100010001011010110011110011110000001100011001110011110001001100110001110011010001001101000111110011110001001100110001110010110111000100010110110001001000101 e7818ce78998e689a3e78998e5b88b59e7818ce78998e689a3e78998e5b88b6245
EUC-JP 轣檎汚謇」迚伜ク戯轣檎汚謇」迚伜ク誼E 11101101111000011011100011101001101100011111100011101011111010011000111010100011111011011110100111010000111001111000111010111000101101011011101011101101111000011011100011101001101100011111100011101011111010011000111010100011111011011110100111010000111001111000111010111000101101011100001101000101 ede1b8e9b1f8ebe98ea3ede9d0e78eb8b5baede1b8e9b1f8ebe98ea3ede9d0e78eb8b5c345
UTF-8 轣檎汚謇」迚伜ク戯轣檎汚謇」迚伜ク誼E 11101000101111011010001111100110101010101000111011100110101100011001101011101000101011001000011111101111101111011010001111101000101111111001101011100100101111001001110011101111101111011011100011100110100010001010111111101000101111011010001111100110101010101000111011100110101100011001101011101000101011001000011111101111101111011010001111101000101111111001101011100100101111001001110011101111101111011011100011101000101010101011110001000101 e8bda3e6aa8ee6b19ae8ac87efbda3e8bf9ae4bc9cefbdb8e688afe8bda3e6aa8ee6b19ae8ac87efbda3e8bf9ae4bc9cefbdb8e8aabc45
UHC ?檎汚???????檎汚?????誼E 001111111101000011010101111001111111110100111111001111110011111100111111001111110011111100111111110100001101010111100111111111010011111100111111001111110011111100111111111010111111111001000101 3fd0d5e7fd3f3f3f3f3f3f3fd0d5e7fd3f3f3f3f3febfe45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)