To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??夜?????弱??厓э?節∽?臆?? 1001011011101001001111110011111110010110111010010011111100111111001111110011111100111111100011101110001100111111001111111111101010001101100001001000111100111111100100001101111110000001111001000011111110001001101100000011111100111111 96e93f3f96e93f3f3f3f3f8ee33f3ffa8d848f3f90df81e43f89b03f3f
EUC-JP 夜??夜?????弱??厓э?節∽?臆?? 110011001110101100111111001111111100110011101011001111110011111100111111001111110011111110111100111001010011111100111111100011111011010011000111101001111110111100111111110000001110000110100010111001100011111110110010101100100011111100111111 cceb3f3fcceb3f3f3f3f3fbce53f3f8fb4c7a7ef3fc0e1a2e63fb2b23f3f
UTF-8 夜쇽푵夜쇽푵若듸쉿弱놅쉘厓э푵節∽풚臆볢쑕 1110010110100100100111001110110010000111101111011110110110010001101101011110010110100100100111001110110010000111101111011110110110010001101101011110111110100101101101001110101110010011101110001110110010001001101111111110010110111100101100011110101110000110100001011110110010001001100110001110010110001110100100111101000110001101111011011001000110110101111001111010111110000000111000101000100010111101111011011001001010011010111010001000011110000110111010111011001110100010111011001001000110010101 e5a49cec87bded91b5e5a49cec87bded91b5efa5b4eb93b8ec89bfe5bcb1eb8685ec8998e58e93d18ded91b5e7af80e288bded929ae88786ebb3a2ec9195
UHC 夜쇽푵夜쇽푵若듸쉿弱놅쉘厓э푵節∽풚臆볢쑕 111001011010100010111100111011111011111010000011111001011010100010111100111011111011111010000011111001011010111010110101111011111011110110110010111001011011000010000110111011111011110110101001111001001110110110101100111011111011111010000011111011111011110110100001111011111011111010011101111001011110011010010011111010001001110010110100 e5a8bcefbe83e5a8bcefbe83e5aeb5efbdb2e5b086efbda9e4edacefbe83efbda1efbe9de5e693e89cb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)